nd 


Cape 


BIOMETRIKA 


BIOMETRIKA 


FOUNDED BY 


W. Е. В. WELDON, FRANCIS GALTON axp KARL PEARSON 


MANAGING EDITOR 


E. S. PEARSON 


Ж айы < №9 “ 
ASSOCIATE EDITOR ar © жыш ле, В, № 
М. G. KENDALL p= any | 
M d pe 
“т, pr а 
татын. 


VOLUME 46 


ISSUED BY 


THE BIOMETRIKA OFFICE, UNIVERSITY COLLEGE. LONDON 


PRINTED AT THE UNIVERSITY PRESS, CAMBRIDGE 


PRINTED IN GREAT BRITAIN 


CONTENTS OF VOLUME 46 
Memoirs and. Miscellanea 


Влхтесілі, CELIA G. (See Pirar, К. С. S.) 

BannacrLovan, ELIZABETH D. and Pace, E. S. Tables for Wald tests for the mean 
of a normal distribution . р 

BARTHOLOMEW, D. J. А test of homogeneity for ordered altas natives 

BARTHOLOMEW, D. J. А test of homogeneity for ordered alternatives. IT 

Barron, D. E. and Davin, Е. №. Sequential oceupaney à à 

BuarE, D. Н. Approximation to the distribution of sample size for — 
tests. I. Tests of simple hypotheses : " Р қ 

Biccers, J. D. The estimation of missing and mixed-up ies ations in several 
experimental designs 

Boss, В. C. and Сорта, б. S. Moments of ondas statisti fom a — popolation 

Box, G. E. P. and Lucas, H. L. Design of experiments in non-linear situations . 

Возн, К. А. and Огкіх. I. Extrema of quadratic forms with applications to 
statistics я 3 ы i Р қ i ч А x | 

CrEMANS, К. б. Confidence limits in the case of the geometric distribution . 

Сохо, B. W. The busy period in relation to the queueing process GI/M/1 

Crow, E. L. and GanpxER, В. S. Confidence intervals for the expectation of a 
Poisson variable В 5 Я : В i қ . 

DannocH, J. N. The multiple-recapture census ‘UL Estimation when there is 
immigration or death ; . в . j қ ) 

Darwin, J. Н. Note оп а three-decision test for comparing two binomial — 

Darwin, J. H. Note on the comparison of several realizations of a Markoff chain 

Davin, F. N. The z-test and symmetrically distributed random variables 

Davin, Е. N. (See Barron, D. Е.) 

Davin, H. А. Tournaments and paired comparisons Я 7 : 

Dursin, J. Efficient estimation of parameters in moving-average models 

DURBIN, J. A note on the application of Quenouille’s method of bias reduction to 
the estimation of ratios . 

vAN ELTEREN, PH. and NogTHER, С. E. The asy ympiotie на сеа: x test фка 
balanced incomplete block design : 4 i е 

GABRIEL, К. В. The distribution of the — of suecesses in a sequence of 
dependent trials 

GARDNER, В. 5. (See Crow, E. L. ) 

Goopman, L. А. Partial tests for partial taus 

Gupra, S. S. (See Bose, В. C.) 

Натант, Е. А. Over flow at a traffic light Е | 

НАвтт.кү, Н. О. The efficiency of internal regression for the Ruine of the exponen- 
tial regression 

Нватносотв, С. В. and Mov J. к. Тһе —Á valk (in A іше айа 
its application to the theory of queues 


PAGE 


141 
336 
106 
412 
123 


139 
306 


293 


400 


- Contents 


Јонхѕох, №. L. On an extension of the connexion between Poisson. and AC 
distributions à à | а А . 4 

Jowerr, G. H. and ҰУмант. Werenpy. М. Jump analysis қ 

Kuarrr. С. G. On certain properties of power-series distributions ; 

Ктмвав, A. W. and [клси. Е. Approximate linearization of the incomplete 
/-function Р : қ е i Е j 

Lawry, D. N. Tests of significance in canonical analysis 

Leaca, E. (See KIMBALL, А. W.) ЖСН 

Leste, D. C. М. Determination of parameters in the Johnson system of probability 
distributions : : j à : қ 

LEVINE, J. Monomial-monomial symmetric function tables 

Lipton, S. (See PATTERSON, H. D.) 

Lrovp, D. Е. Note опа problem of estimation 

Lucas, Н. L. (See Box, G. E, Р.) 

Marrows, C. L. On the probability integral transformation 5 : , 

MERCER, А. and SMITH, С. S. А random walk in which the steps occur randomly 
in time Р " . : Е р д ; ; 4 3 в. 

Mites, В. E, Тһе complete amalgamation into blocks. by weighted means, of a 
finite set of real numbers : à . . . . . 

Міттох, В. С. and Мовслх, F. В. The design of factorial experiments: a survey 
of some schemes requiring not more than 256 treatment combinations 

Мовслх, Е. В. (See Міттох, В. G.) 

Movar, J. Е. (See Нкатнсоть. С. В.) . 

Nass, C. А. С. The y? test for small expectations in contingency tables. with 
Special reference to accidents and absenteeism 

Ховтнив, G. E. (See VAN ELTEREN, Pu.) 

Orxiw, I. (See Busu, К. А.) 

РАСНАВЕЗ, J, Table of the upper 10% points of the *studentized" range 

Рлав, Е. S. (See Bannacrovam. Ештавктн D.) 

PaTrTERSON, Н. D. and Ілртох, 5. 
fitting an exponential curve > ; А Е : . : 5 
PEARSON, E. <. Note on Mr Quenouille’s Edgeworth Type A transformation , 
PEARSON, Е, 8. Note on an approximation to the distribution of non-central y? 


і h 7 y ; Жет ial 
Рнилльз, A. W. The estimation of parameters т systems of stochastic differentia 
equations 


Рплат, К. C. $, Upper 
the sample mean : à ; 5 Е , . . : S se 

Риллт, К. С. 5. апа Banvecut, CELIA (i. On the distribution of the largest of six 
roots of a matrix in multivariate . . : қ 

Риллт, К, C, On Hotelling's generalization 72 


>. $ + . H 17 
Рил‚лт, К. С. S. and T IENZO. В.Р. On the distribution of the extreme studentized 
deviate from the sample mean 


QUENOUILLE, M. H. Table 
RAMASUBBAN. T. A. The 
distributions 


à Ba Hy or 
An investigation of Hartlev's method fi 


percentage points of the extreme studentized deviate fror 


analysis 
М. and Samson, P.. Jr, 


ürd distributions 
і = «ОП 
binomial and Poisso! 


$ of random Observations from stand 
generalized mean differences of the 


Contents 


Ало, C. г Some problems involvi ing linear hypotheses in multivariate analysis . 

Samson, P., Jr. (See Рпл,ат, К. С. М.) 

о м. On the non-central y? distribution à Я 1 z А 

Saw, J. G. Estimation of the normal population parameters given а singly 
censored sample . A 8 z 

SHENTON, L. R. The distribution of moment estimators 

міти, C. S. (See MERCER, А.) 

8мітн, W. L. On the cumulants of renewal processes р > 

Somers, В. H. The rank analogue of product-moment partial aetatibus and 
regression, with application to manifold, ordered contingency tables 

Srivastava, А. B. L. Effect of non- normality on the power of the analvsis of 
variance test . ` 

Тікх2о, В.Р. (See PILLAI K. C. S.) 

\Увтент, \Увхру M. (See Jowerr, G. H.) 


Book Reviews 


ANDERSON, O. Probleme der Statistischen Methodenlehre in 


den Sozialwissenschaften (Third edition) . л z Р C. A. MOSER 
Bers, L. Surveys im Applied Mathematics. Vol. 3. Mathe- 

matical Aspects of Subsonic and. T'ransonic Gas Dynamics С. L. Млшоуув 
Ввувктох, В. J. Н. and Horr, S. J. On the Dynamics of 

Exploited Fish Populations . : F. N. Davip 
Briss, C. І. Periodic Regression in БЙРЙ айй Ülimaislagy " F. N. Davip 
Boore, G. Calculus of Finite Differences $ Е. М. Davip 
Вовтхатох, В. S. and May, D. С. Handbook of Probability and 

Statistics with Tables қ ы Р . я F: N. Davip 
Сновснмам, С. W., Ackorr, В. L. and нон Е. L. Intro- 

duction to Operations Research. Е s T ; А ALISON Dore 
Crew, У. Experimental Designs in Industry . > 5 , N. L. Јонҳвоҳ 
Cox, D. R. Planı “4 ng of Experiments . . 5 > . С. L. Marrows 
Demográfia, Vol. 1, Part 1 . А x ‘ N. H. CARRIER 
Dunsors, P. H. Multivariate УУГ. Айй Е J. DURBIN 
Fisz, M. Wahrscheinlichkeitsrechnung und Mathematische 

Statistik . " š z - REGINA C. ELANDT 


FonsvmTHE, б. Е. and a Р: с. Беде іт Applied 
Mathematics. Vol. 5. Numerical Analysis ата Partial 


Differential Equations — . . C. L. Матлоуѕ 
Gass, S. I. Linear Programming: Methods dnd равна S. VAJDA 
GooDiER, J. N. and Hopaz, P. G. Surveys in Applied Mathe- 

matics. Vol. 1. Elasticity and Plasticity . қ В қ С. L. Мало\мз 
Goopman, В. Teach yourself Statistics . . . . i D. E. BARTON 
Gurra, H.. GwvrTHER, C. E. and Милек, J. C. P. Tables of 

Partitions А . : . : D. E. BARTON 


Нонх, Е. E. üenantaby Matriz Ямна " ! . à @ Ls Мати 


vii 
PAGE 
49 


498 
497 


viii Contents 


Hurcuinson, Str JosEPH. Genetics and Improvement of 


Tropical Crops — : А А 5 " à à А. В. G. OWEN 
Kapuansky, Г. et al, Surveys in Applied Mathematics. Vol. 4. 
Some Aspects of Analysis and Probability. Е Р C. L. МАШОМ 
KENDALL, М. С. and Sruarr, А. The Advanced Theory of 
Statistics. Vol. 1. Distribution Theory — . 4 : . Е. S. PEARSON 
Lemanis, E. and Мтховзку, №. Surveys in Applied. Mathe- 
matics. Vol. 2. Dynamics апа Nonlinear. Mechanics А С.І, MaLLOWS 
Li, J. С. В. Introduction to Statistical Inference . ; : F. N. DAVID 
Moonz, P. G. Principles of Statistical Techniques. A First 
Course from the beginnings for Schools and Universities . A. P. PENFOLD 
Morse, P. M. Queues, Inventories and Maintenance. . D. М.С. WISHART 
Orsoy, E. C. and Мплек, В. L. Morphological Integration . Е. М. DAVID 
Oscoop, W. Е. Functions of Real and Complex Variables . Т. ESTERMANN 
PANIZZON, G. Contribuito allo Studio delle Tavole di Nuzialita C. А. B. SMITH 
Реск, L. б. and HazeLwoop, В. N. Finite Queueing Tables. D. M. G. WISHART 
Риллт, К. C. S. Concise Tables for Statisticians. Е 1 D. E. BARTON 
РоРРЕВ, К. В. The Logic of Scientific Discovery . : i М. С. KENDALL 
QUENOUILLE, M. H. Analysis of Multiple Time-Series . Ё G. M. JENKINS 
Quzxovrrrs, M. Н. Fundamentals of Statistical Reasoning . C. L. MALLOWS 
Ктоврах, J. An Introduction to Combinatorial Analysis Я Е. №. Davip 
SALMON, G. А T'reatise on the Analytic Geometry of Three Dimensions 
(Seventh edition) Vol. 1; р Ж А , қ қ F. М. DAvID 
Sauzer, Н. E. et al. T'able for the Solution of Cubic Equations D. E. BARTON 
Statistical Research, M. onographs, Note regarding е ‘ ғ . 2 
TAYLOR, А. E. Introduction to F unctional Analysis ы as В. E. EDWARDS 
Tonaznsox, W. S. Theory and Methods of Scaling Ба 7% C. BURT 
Vazsoxvi, A. Scientific Programming in Business and I ndustry S. VAJDA 
Winnicx, L. American Housing and its Uses қ P . Marian BowLEY 
*. а 
, 
БЕ] 
‚т 


BIOMETRIKA PUBLICATIONS 


Issued by the Cambridge University Press, Bentley House, London, N.W. 1 


and obtainable from any bookseller 


Tables of the Incomplete B-Function Ерітер ву KARL PEARSON 
59 pages of Introduction and 494 pages of Tables Price: 55s. net 


Tables of the Incomplete r-Function EDITED By KARL PEARSON 
31 pages of Introduction and 164 pages of Tables Price: 42s. net 


Tables of the Complete and Incomplete Elliptic Integrals 
(from LEGENDRE's Traité des Fonctions Elliptiques. With autographed portrait of LEGENDRE) 
39 pages of Introduction by KARL PEARSON and 94 pages of Tables Price: 12s. 6d. net 


Tables of the Ordinates and Probability Integral of the Distribution of 
the Correlation Coefficient in Small Samples By F. N. DAVID 
38 pages of Introduction, 55 pages of Tables, 10 Diagrams and 4 Charts Price: 17s. 6d. net 


Biometrika Tables for Statisticians, Vol. I 
Ерітер BY E. S. PEARSON and Н. О. HARTLEY for the Biometrika Trust 


102 pages of Introduction and 136 pages of Tables Price: 25s. net 


The Life, Letters and Labours of Francis Galton, Vols. I, П, IIIA, & IIIS 
By KARL PEARSON, F.R.S. Price; £3. 3s. net 


Karl Pearson: An Appreciation of Some Aspects of his Life and Work 
By E. S. PEARSON Price: 15s. net 


A Bibliography of the Statistical and Other Writings of Karl Pearson 
COMPILED BY С. М. MORANT, with the assistance of B. L. WELCH Price: 6s. net 


« Student's" Collected Papers EDITED By E. S. PEARSON and 
JOHN WISHART with а FOREWORD BY LAUNCE McMULLEN Price: 21s. net 


Karl Pearson's Early Statistical Papers 


Reprinted by photo-lithography for the Biometrika Trust, with the permission of the original publishers. 
The Volume contains eleven papers, including the more important of the memoirs entitled “ Mathematical 
Contributions to the Theory of Evolution”, first published in the Philosophical Transactions of the Royal 
Society. The original paper deriving the y*-distribution, published in 1900 in the Philosophical Magazine, is 


also included. Price: 25s. net 


NEW STATISTICAL TABLES: SEPARATES RE-ISSUED 
FROM BIOMETRIKA 


To be obtained from 
BIOMETRIKA OFFICE, UNIVERSITY COLLEGE, LONDON, W.C.1 


1. From Biometrika, Vols. 22, 27 and 28 
Tests of Normality. By E. S. PEARSON and R. C. GEARY Price 2s. 64., post free 
Il. From Biometrika, Vol. 32, pp. 168-181 and 188-189 
(1) Table of percentage points of the incomplete beta-function 
(2) Table of percentage points of the x? distribution дай 
Stitched together with introductory matter. Price 25. 6d., post fr' 
Ш. From Biometrika, Vol. 32, pp. 300-310 
(1) Table of the probability integral of the range in samples from a normal population 
(2) Table of the percentage points of the range 
(3) Table of the percentage points of the t-distribution 1 
Stitched together with introductory matter. Price 2s. éd., post fre 
IV. From Biometrika, Vol. 33, pp. 73-88 
Table of percentage points of the inverted beta (F) distribution E 
With introductory matter. Price 2s. 6d., post fr 
V. From Biometrika, Vol, 33, pp. 252-265 


(1) Table of the 
population 


(2) Table of the Percentage points of the mean deviation 


ге 

Stitched together with introductory matter. Price 2s. 6d., post fr 

VI. From Biometrika, Vol, 33, pp. 296-304 
Table for testing th 


al 
Probability integral of the mean deviation in samples from a norm 


e homogeneity of a set of estimated variances 


e 

With introductory matter. Price 2s., post fre 

ҮП. From Biometrika, Vol. 35, pp. 145-156 

Table of significance levels for the 
tables. By D. J, FINNEY 


МШ. From Biometrika, Vol. 35, pp. 191—201 
Table for the calcu 
and W. L. STEVENS 


i 4 Т H ncy 
Fisher-Yates test of significance in 2x2 ipei 
With introductory matter. Price 2s. 6d., post 


У 
lation of working probits and weights in probit analysis. By D. J. FINNE 


e 
With introductory matter. Price 2s. 6d., post fre 
IX. From Biometrika, Vol, 36, pp. 267-289 


of autoregressive series, B 


Tables 
X, XIV, XVIII, and XX. From Bio 


-242 

‚ РР. 427-446, and Part 5 from Vol. 42, pp. 223-24 

NDALL ЪЁ 
With introductory matter. Price 145. 6d., post % 

(Part 1, 25. 6d.; Parts 2 and 3, 4s.; Part 4, 45.; Part 5, 

‚ Vol. 39, p. 190 and Vol. 43, pp. 449-451 

Ee points of the extreme “Studentized” 

By K. R. NAIR and H. A. DAVID 


XII. From Biometrika, Vol. 37, 

(1) Table of the Probabilit 

(2) Table of the y? inte 
and E. S. PEARSON 


deviate from the sample mean: 
With introductory matter. Price 1s., post fre? 
РР. 168-172 and РР. 313-325 
y integral of the t-distribution 
gral, and of the cumulative Poisson distribution. By Н. О. HARTLEY 
Stitched together with introductory matter. Price 5s., post free 


ИИ 


NEW STATISTICAL TABLES: continued 


XIII. From Biometrika, Vol. 38, pp. 112—130 
Charts of the power function for analysis of variance tests, derived from the non-central 


F-distribution. By E. S. PEARSON and H. O. HARTLEY 
With introductory matter. Price 2s. 64., post free 


XV. From Biometrika, Vol. 38, pp. 423-426 
A chart for the incomplete beta-function and the cumulative binomial distribution. By H. O. 
HARTLEY and E. R. FITCH With introductory matter and ruler scale. Price 2s. 6d., post free 


XVI. From Biometrika, Vol. 40, pp. 70-73 
Tables of the angular transformation. By W. L. STEVENS 
With introductory matter. Price 1s., post free 
XVII. From Biometrika, Vol. 40, pp. 74-86 
Tests of significance in a 2x 2 contingency table: extension of Finney's table (No. VII). 
Computed by R. LATSCHA With introductory matter. Price 2s. 6d., post free 


XIX. From Biometrika, Vol. 41, pp. 253-260 
Tables of generalized k-statistics. By S. Н. ABDEL-ATY With introductory matter. Price 25., post free 


XXI. From Biometrika, Vol. 42, pp. 494-511 
A new form of table for significance tests in a 2x2 contingency table. By P. ARMSEN 
With introductory matter. Price 2s. 6d., post free 
XXII. From Biometrika, Vol. 43, pp. 388-403 
Tables for certain applications of sequential methods in the analysis of variance. By W. D. RAY 
With introductory matter. Price 2s. 64., post free 
XXIII. From Biometrika, Vol.-43, рр. 423-435 
Table for determining confidence limits for a proportion in binomial sampling. By EDWIN L, 
CROW With introductory matter. Price 2s. 64., post free 


XXIV. From Biometrika, Vol. 44, pp. 411-419 
Tables for estimating the normal distribution function of normit analysis. Part І. Tables and 
description of their use. By JOSEPH BERKSON With introductory matter. Price 2s. éd., post free 


XXV. From Biometrika, Vol. 44, pp. 482-489 
Table of significance points for a two-sample t-test based on range. By P. G. MOORE 
With introductory matter. Price 2s. 6d., post free 
XXVI. From Biometrika, Vols. 44 & 45 
Tables of the upper percentage points of the generalized beta distribution. By F. G. FOSTER 
and D. H. REES With introductory matter. Price 5s., post free 


From Biometrika, Vol. 44, pp. 490-514 
A bibliography on the theory of queues. By ALISON DOIG Price 5s., post free 


From Biometrika, Vol. 45, pp. 293-315 
THOMAS BAYES'S Essay towards solving a problem in the doctrine of chances. [Reproduced 
from Phil. Trans. Roy. Soc. 1763, 53, 370-418.] With a biographical note by G. A. BARNARD. 


Price 5s., post free 


From Biometrika, Vol. 45, pp. 521-543 M | 
A bibliography on life testing and related topics. By WILLIAM MENDENHALL Price 5s., post free 


BIOMETRIKA INDEX. Comprising Subject Index for Vols. 1-37 and Author Index for Vols. 
1-40, with Author Index Supplement covering Vols. 41-43. Price 6s. or $1.00, post free 


Gii) 


STATISTICAL EXERCISES 


Issued by the DEPARTMENT OF STATISTICS 
UNIVERSITY COLLEGE, LONDON, W.C.1 


Part I. Elementary Statistical Exercises. Compiled by F. N. DAVID (91 pp-) 


These exercises, published in 1953, were collected in connection with the numerical classwork 
undertaken by students during the first year of the B.Sc. Special Degree 


course in Statistics at 
University College. They deal with applications of the more element 


ary univariate and bivariate 
Price: 6s. 6d. 


Part II. Statistical Exercises, Analysis of variance and associated techniques. — (107 pp.) 
Compiled by N. L. JOHNSON 
This volume includes over 100 e 


хегсіѕеѕ on the following topics: 
Analysis of variance techniq 


ues for the following experimental designs: randomized block, Latın 
square, confounding, split plots, fractional replication. Linear and curvilinear regression, Dosage- 
mortality techniques, Discriminant functions. Time series. Curve fitting. Price 125 


[A Supplementary volume containing suggested solutions to à substantial proportion of the exercises 
in Part II is in preparation.] 


t, London, W.C. 1. If ordered direct, payment must be made in 
heques made paya 


JOURNAL OF THE 
AMERICAN STATISTICAL ASSOCIATION 


VOL. 54, No, 285 CONTENTS Marcu 1959 


2 = +. . EPH b. 
imum Univariate Stratification. Токе DaLENIUS and JOSEP! 
Норсез, Jr.—Some Leading Sovi i 
Cross-Classifications : 


mments on *The Simplest Signed- 
Trelation Coefficient. Біснакр Н: 


BIOMETRICS 


Journal of the Biometric Society 


Vor. 15, No. 1 TABLE OF CONTENTS Marcu 1959 


Host Variability in Dilution Experiments. PETER AnMrrAGE—Restricted Selection Indices. OSCAR KEMPTHORNE 
and Акме W. NorpskocG—Equilibria in Auto-Tetraploids Under Natural Selection for a Simplified Model of 
Viabilities. P. А. PaRsoNs—The Analysis of а Non-Replicated Experiment Involving а Single Four-Course 
Rotation of Crops. Н. D. PATTERSON—The Analysis of a Two-Phase Experiment. В. М. CunNow—Analysis of 
Quadruple Rectangular Lattice Designs. Јонм Leroy Еогкѕ—Ап Examination of Some Methods of Comparing 
Several Rates or Proportions. MINDEL C. 5нер5--Тһе Analysis of Experiments on Growth Rate. Е. B. LEECH 
and М. J. В. HEALvy— Calculation of Chi-Square to Test the No Three-Factor Interaction Hypothesis. MARVIN A. 
KASTENBAUM and DONALD E. LAMPHIEAR—Extra-Period Change-Over Designs. H. D. PATTERSON and Н. L. 
LucAs—Query: Replication of Non-Center Points in the Rotatable and Near-Rotatable Central Composite 
Deum. G. E. P. Box—Query: Significance of Difference Between Two Non-Independent Correlation Coefficients. 
. 1. WILLIAMS. 


Vor. 15, No. 2 June 1959 
A Simple Method for Constructing Orthogonal Polynomials When the Independe i . 
D. S. Ковѕом—Тһе Estimation of Environmental and Genetic Trends ре аар equally браон, 
HENDERSON, OSCAR KEMPTHORNE, S. R. SEARLE and С. M. VON KrosiGkK—Experimental Design in the Evaluation 
of Genetic Parameters. ALAN ROBERTSON—A Distribution-Free Asymptotic Method of Estimating, Testing, and 
Setting Confidence Limits for Heritability. LORRAINE SCHWARTZ and STANLEY WEARDEN— The Regression 
Analysis of Causal Paths. MALCOLM E. TURNER and CHARLES D. STEVENS—A Class of Two Replicate Incomplete 
Block Designs. J. Rov—The Centric Systematic Area-Sample Treated as a Random Sample. A. MiLNE—Sensory 
Item Sorting. М. T. GRripGEMAN—Orthogonal Contrasts in Slope Ratio Investigations. P. J. CLARINGBOLD— Note: 
A Confidence Interval on the Abscissa of the Point of Intersection of Two Fitted Linear Regressions. MARVIN А. 
KASTENBAUM—Note: On the Estimation of the Mean of Poisson Distribution from a Sample with the Zero Class 
Missing. J. O. InwiN—Query: Differential Regression. JOHN T. WEBSTER. 


Biometrics is published quarterly. The annual non-member subscription rate is £7.00. Subscriptions, inquiries, and 
orders for back issues should be sent to: 
BIOMETRICS 


DEPARTMENT OF STATISTICS, VIRGINIA POLYTECHNIC INSTITUTE 
BLACKSBURG, VIRGINIA, U.S.A. 


TRABAJOS DE ESTADISTICA 


REVIEW PUBLISHED BY INSTITUTO DE INVESTIGACIONES ESTADÍSTICAS 
OF THE CONSEJO SUPERIOR DE INVESTIGACIONES CIENTÍFICAS 


MADRID, SPAIN 


Vol. IX. Cuad. III 
CONTENTS 


P. ZonoA—Convolución de histogramas. 

D. MARAVALL—La adición de vectores aleatorios isótropos en un espacio de М dimensiones. 
Noras. 

E. CANSADO—Sobre la inversión de matrices de Leontief. 


J. M. Garcia GarciA—Problema de la selección de la Cartera. 


CRONICA. 
BIBLIOGRAFIA. 
CUESTIONES Y EJERCICIOS. 


For everything in connection with works, exchanges and subscription write to Professor Sixto Rios, Instituto de 
Investigaciones Estadísticas, Consejo Superior de Investigaciones Cientificas, Serrano no. 123, Madrid, Spain. 

The Review is composed of three fascicles published three times a year (about 350 pp.), and its annual price is 
100 pesetas for Spain and $4.00 U.S.A. for all other countries. 


Annals of Human Genetics 
Formerly ANNALS OF EUGENICS 


Edited by L. S. PENROSE 


Vor. 23, Pr. 2 CONTENTS FrBRUARY 1959 
Testing association of metric traits and marker genes. Donoruv C. Lowry and Е. T. Scnut T7— Microcephaly 
in the Netherlands: a clinical and genetic study. J. VAN DEN Boscu— > 

of galactosemic children. H. N. KIRKMAN and E. BvNuM— Muscular dystrophy in childhood — the genetic aspect. 


HELEN BLYTH and В. Pucu— Distribution of birth weights of Chinese and Indian infants born in Singapore: 


birth. weight as an index of maturity. JEAN MILLIs—Frequency of twinning in poor Chinese in the maternity 


5 five populations. H, ELDON SUTTON, 
АРТЕК and L. E. TROMBLEY— Reviews. 


Mav 1959 


М. К. NAMBOODIRI апа V. 

approach to homozygosity. G. A. WATTERSON— 

S approach to homozygosity. G. A. WATTERSC ome comments on 

uman sex ratio. А. W, Е. EDwARbs— Some statistical data on atypical 

an serum. W. KALow and D. В. GUNN— The numbers of aunts and uncles of normal and 
1 children. С. Кмох--Тавіс sensitivity to phenyl-thio-urca in tuberculosis and diabetes 
ritance of celiac disease. C. Carter, W. SHELDON and C. Warkrn—The in- 

уріс population. A. С. SEARLE— Birth weight in Cystic fibrosis of the pancreas. 


mer's disease. Говмл WHEELAN—A note on the effects of method of ascertain- 
« C. A. B. Surru—Review. 


Subscription Price 1005. net per volume of four quarterly parts (in U.S.A, $17.50) post free. 
Single issues, 30s, (т U.S.A. $5.00) postage extra. 


CAMBRIDGE UNIVERSITY PRESS 
BENTLEY HOUSE, 200 EUSTON ROAD, LONDON, N.W. 1 


The Annals of Mathematical Statistics 


The Official Journal of the Institute of Mathematical Statistics 
VOL. 30, No, 1 CONTENTS MARCH, 1959 


OLIVE JEAN 
AFFEY— Bayes and Minimay 
—An Approximation Usefu 


—Notes: Ва 
n Functio 


ECONOMETRICA 


JOURNAL OF THE ECONOMETRIC SOCIETY 


Contents of Vol. 27, No. 1, January 1959, include: 


С. J. улм Ерк and J. SANDEE. Quantitative Determination of an Optimum Economic Policy. 
Немк THEIL. The Aggregation Implications of Identifiable Structural Macrorelations. 

ROBERT SoLow. Competitive Valuation in a Dynamic Input-Output System. 

LioNEL W. McKenzie, On the Existence of General Equilibrium for a Competitive Market. 

R. L. BASMANN. The Computation of Generalized Classical Estimates of Coefficients in a Structural Equation. 
KENNETH J. ARROW, Н. D. BLOCK and L. Hurwicz. On the Stability of Competitive Equilibrium. 
Н. F. LvpaLL. The Distribution of Employment Incomes. 

Murray BROWN. On the Structure of Stochastic Difference Equation Models. 

ROBERT SUMMERS. А Note on Least Squares Bias in Household Expenditure Analysis. 

S. J. Prats. A Comment. 

Book REVIEWS. 


Published Quarterly Subscription rate available on request 
The Econometric Society is an international society for the advancement of economic theory in its relation to 
statistics and mathematics. 


Subscriptions to Econometrica and inquiries about the work of the Society and the procedure in applying for 
membership should be addressed to 


RICHARD RUGGLES, Secretary : 
THE ECONOMETRIC SOCIETY, BOX 1264, YALE UNIVERSITY 
NEW HAVEN, CONNECTICUT, U.S.A. 


SANKHYA 


THE INDIAN JOURNAL OF STATISTICS 
EDITED BY P. C. MAHALANOBIS 


Vol. 20, parts 3 and 4, 1958 CONTENTS 

The Scope of Biological Statistics. J. B. S. HALDANE. 

Examples of Inconsistency of Maximum Likelihood Estimates. R. R. BAHADUR. 

Maximum Likelihood Estimation for the Multinomial Distribution with Infinite Number of Cells. C. RADHAK- 
RISHNA RAO. 

А Remark on Strong Measurability. V. S. VARADARAJAN. 

A Useful Convergence Theorem. V. S. VARADARAJAN. 

On Statistics Independent of Sufficient Statistics. D. Basu. | 

On Configurations and Non-Isomorphism of Some Incomplete Block Designs. M. ATIQUALLAH. 

Tables of Random Normal Deviates. J. M. SENGUPTA and NIKHILESH BHATTACHARYA. 

On Sampling With and Without Replacement. D. Basu. 

Diffusion by Discrete Movements. H. С. Gupta. 

On the Distribution of the Maximum Value of an Equally Correlated Sample from a Normal Population. A. Kupó. 

Hypergeometric Expansions for Incomplete Moments of the Bivariate Normal Distribution. A. R. KAMAT. 

Incomplete Moments of the Trivariate Normal Distribution. A. В. KAMAT. 

A Note About a Family of Perks’ Distribution. JOSEPH TALACKO. | 

Recent Experiments in Statistical Sampling in the Indian Statistical Institute. P. C. MAHALANORIS. 


SUBSCRIPTION CURRENT BACK NUMBERS 


per volume per issue per volume per issue 
INDIA Rs. 30 Rs. 10 Rs. 45 Rs. 12/8 
FOREIGN $10.00 $3.50 $15.00 $4.50 


Subscriptions and orders for back numbers should be sent to 


Statistical Publishing Society, 204/1 Barrackpore Trunk Road, Calcutta – 35 


(vii) 


ROYAL STATISTICAL SOCIETY 


THE JOURNAL OF THE ROYAL STATISTICAL SOCIETY is published in two series: SERIES А 
(GENERAL), four issues a year, annual subscription £4. 4s. post free; SERIEs B (METHODO- 
LOGICAL), two issues a year, annual subscription £3. 2s. post free. 


SERIES А (GENERAL), VOL. 122, PART І, 1959 


i i istributi гі iscussion) — Report of the 

he Long-term Trend in the Size Distribution of Income. Ву Н. Е. ураш. (with Discussion) рог! He 
hu on the Supply of and Demand for Statisticians—Introduced by Professor Е PEARSON (piii Біз 
cussion)—Size of Company and Other Factors in Dividend Policy. By P. SARGANT FLORENCE — Reviews o 5, 
Statistical and Current Notes, Additions to Library. 


ometric Distribution: By C. B. WINSTEN (with Discussion)— Behav iour Sequences 
Miei eie Chains. By VioLET В. CANE (with Discussion) —On a Multivariate Version of Fieller's H qw 
By B. M. Вемметт--Тһе Process Curve and the Equivalent Mixed Binomial with Two Components. М деін 
VAGHOLKAR—The Information in ап Experiment. By C. L. MALLows—The Behrens-Fisher Distribution 2 
Weighted Means. Ву G. S. JaMEs—The Estimation of Relationships with Autocorrcl 


of Instrumental Variables. By J. D. SarGAN—On Some Problems of Machine Interference. 
On the Distribution of Various Sums of Squ 


Correlated and Non-Homogeneous Errors 


Е. М. DAvip—The Limiting Frequencies of Integers with a Given Partitional Characteristic. By №, Е. ане 
Some Simple Duration-Dependent Stochastic Processes. By A. MERCER— Cyclic Queues with. Feedbac E ned 
P. D. FINCH—A Statistical Theory of Remnants. By J. Arrcuisox—Bandwidth and Resolvability in siars у: 
Spectral Analysis. Ву 2. А. LoMNick1 and 5. К. ZAREMBA—On a Property of Incomplete Blocks. By R. N eric 
JONES—A Renewal Problem with Bulk Ordering of Components. By D. R. Cox—The Dispersion of a dinde 
Species. By D. E. BARTON and Е. М. Davip—On a Discriminatory Problem Connected with the Works 0 loss 
By D. R. Cox and L. BRANDWoop—Experiments with Mixtures. By M. H. QuENOUILLE—A Different та 
Function for the Choice between Two Populations. By RITA J. MAURICE—Censored Observations in Randon dn 
Block Experiments. By M. R. SAMPFORD and J. TAvLoR—Corrigenda: Dilution Series: A Statistical Tes 
Technique. By W. L. STEVENS—The Regression Analysis of Binary Sequences. By D. R. Cox. 


ROYAL STATISTICAL SOCIETY, 21 BENTINCK STREET, LONDON, W.1 


APPLIED STATISTICS 


A JOURNAL OF THE ROYAL STATISTICAL SOCIETY 
VOLUME VII, No. 3 


CONTENTS Моувмвен 1958 
Analysis of Variance of a Randomised Block Design with Missing Observations. WitLIAM A. GLENN and CLYDE 
YOUNG KRAMER. 


Professor Hogben’s ‘ 


les of Sampling Inspection by Attributes. HuGo C. HAMAKER. 
The Lognormal Distribution in Quality Control. James Morrison, 
NOTES AND COMMENTS. 
MEETINGS ОЕ SECTIONS OF THE ROYAL STATISTICAL Society. 
Book REVIEWS AND PUBLICATIONS RECEIVED. 


=. 


VoLUME VIII, No. 1 


Some Aspects of Quality Control in 
The Use of the Same Experimental 


Marcu 1959 


n the Derivation of an Index of 


à Discontinued Series, K. S. BANERJEE. 
ntages. SIDNEY ROSENBAUM. 


A Significance Chart for Регсе 
NOTES AND COMMENTS. 
MEETINGS OF SECTIONS OF THE ROYAL 


BooK REVIEWS AND PUBLICATIONS ВЕ 


STATISTICAL SOCIETY. 
CEIVED. 


APPLIED STATISTICS is published three times per year. The annual subscription is £1. 10s. (U.S.A. and 
Canada $5.00). Single copies 13s. post free. Orders should be sent to 


Oliver and Boyd Ltd., Tweeddale Court, 14 High Street, Edinburgh, 1 


VOLUME 46, PARTS 1 AND 2 JUNE 1959 


ON THE CUMULANTS OF RENEWAL PROCESSES 


Ву WALTER L. SMITH} 


DEDICATED TO THE MEMORY OF JOHN WISHART 


1. INTRODUCTION 


Let X, X5, X,, ... be a renewal process; that is, a sequence of independent, non-negative, and 
identically distributed random variables which are not all zero with probability one. If 
there is a real, strictly positive à such that, with probability one, all the Х, are divisible by 6, 
then we shall call the renewal process discrete. A renewal process which is not discrete is 
termed continuous. Write F(.) for the distribution function of X, and д, = EXS оо), 
S, = X+ Xa +... +X; and Ё(.) for the distribution function of S,. 

Renewal theory is concerned primarily with the random variable N, which is defined to be 
the maximum suffix Ё for which S; < t, subject to the convention N, = 0 if X, > t. In the 
obvious renewal application the X; represent successive lifetimes of the object being 
renewed, and № is the number of renewals made by time # (subject to the original object 
having been installed at time 0). However, renewal theory has proved to be of significance 
in a wide range of investigations. We mention the theory of queues (Smith, 1953; Beneš, 
1957), the theory of electronic counters (Takács, 1956; Smith, 1957), the theory of regenera- 
tive stochastic processes (Smith, 1955), the study of the variance-time curve, in its applica- 
tion to problems of nerve physiology (Cox & Smith, 1954; Cox, 1955), and the theory of 
cricket scores (Skellam & Shenton, 1957). 

Previous theoretical work on renewal theory has been concerned almost exclusively (for 
very good reasons) with the first moment №. We refer to papers by Feller (1941), (1949), 
Tacklind (1945), Blackwell (1948), and Smith (1954). A typical result concerning &N,is that 
if the renewal process is continuous and > < oo, then as t + +00, 

iti La. (а- | +0(1). 
ға 1 
This theorem was first proved, under slightly unnecessary restrictions, by Tücklind (1945), 
and in its final form by Smith (1954) (see also, Smith, 1957). 

The study of higher moments of N, seems to have been neglected, apart from a rather non- 
rigorous and short account by Skellam & Shenton (1957). Feller (1949) proves a theorem 
concerning the variance of №, in the discrete renewal process, and Smith (1954) proves the 
corresponding result for continuous renewal processes: if и» < со then as t > +оо 

2 
н vaN = t 0), 
ға 
A result of Cox & Smith (1954) concerning a related ‘equilibrium’ renewal process also 
suggests that if jj < 00 then there are constants А and В such that as t+ +оо, 
var N, = 41+ В +0(1). A proof of such a result does not seem to be available, and nothing 
appears to have been established concerning the higher cumulants of Л). It is with these 


higher cumulants that the present paper is concerned. 


T This paper was prepared with the partial support of the Offiee of Naval Research. 
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The theorems concerning ФА) which are mentioned above have all been proved without 
irrelevant assumptions concerning Ё(.). However, reference to Smith (1954) will show that 
а very special type of argument was needed to prove that var N, ~ At, which shows no prose 
pect of generalizing to higher moments of №, nor of yielding the linear approximation 
At 4- B. Write 4 for the class of distribution functions F(.)for which F,(.), for some finite k, 
has an absolutely continuous component. Тһе present discussion of the higher eum ulants of 
N, for continuous renewal processes, will be based on the weak, but extraneous, assumption 
that F(.)e 4". This assumption is assumed to hold throughout this paper even when not men- 
tioned. It will be noticed that once this extra assumption has been made the investigation 
proceeds with little real difficulty. Moreover, the fact that I(.)e4 will be found to imply 
certain variational properties of remainder terms that arise. 

Thepaperfallsnaturally into two parts. The first part ($$ 2-5) is concerned with the rigorous 
proof that the nth cumulant of N, has a linear asymptotic form, under general conditions. 
Two theorems (Theorems 3 and 4) are proved, which are jointly equivalent to the following: 


THEOREM 0. If F(.)e.? and Ира < ©, р > 0, then there exist constants a, and b, such 
that the nth cumulant of N, is given by 
A(t) 
t cur T, АИ 
а, ton ta +02’ 


where A(t) is of bounded total variation, is o(1) as t > + оо, satisfies te condition 
A(t) - A(t—a) = o(t) 


as t > +00 for every fixed о > 0, and when p > 1 has the additional property that 2(0/(1--0 
belongs to the class Гах ` 
This theorem substantially generalizes Theorems 8, 9, and 10 of Smith (1954) which were 
concerned with different aspects of Theorem 0 in the case n = 1. в 
The second part ofthe paper ($$ 6, 7) is concerned with the evaluation of a,, and b, in term 
of the y,. Their values are given for n = 1,2,...,8. { 
In $2 we introduce certain unconventional moments and cumulants which are more 
convenient in the present investigation than the conventional ones, and we study thet 
Properties. In $3 а number of preliminary lemmas are proved, including in particular 
Lemma 4 which implies a property of the remainder term in Taylor expansions of charac 
teristic functions which we believe to be of independent interest, In $4 we develop а sy™ 
bolic calculus for dealing with complicated Laplace transforms that arise and use this 
calculus to prove Theorems 1 and 2 concerning the asymptotic behaviour of moments 9 
М. These results are preparatory to $5 in which Theorem 0, above, is essentially proved- 
In $6 we show how the а, and b,, mentioned above, may be computed. The method W°? 
used is described in sufficient detail to make it possible to extend the list of formulae give? 
for (a,) and (b,), if required. In $7 various checking systems are diseussed which assure Lu 
that the calculations of $ б are correct. A special case which we will quote here, and which 19 


а consequence of Theorem 0 and the computations in $$ 6 and 7, is as follows: If из < oo then 
аві-> + со, 


-g 508 2 
var N, 2 ға, | ( H2 _ 24% h) 
SEC А 3R м) +901). 
The asymptotic linearity of the cu 


mulants of № implies a curious combinatorial identity, 
which we have been unable to prove 


directly. We discuss this identity in an appendix. If it 
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could be established by a direct and elementary argument, then part of the proof of Theorem 0 


could be drastically simplified. 
Notice that this entire investigation is exclusively concerned with continuous renewal 


processes. Nevertheless, there is a parallel treatment for the discrete process, in which the 
requirement Ё(.)е.Я can (and, of course, must) be dropped. 

Тһе author is grateful to his friend Dennis V. Lindley, who asked the questions which 
provoked the present work. He also wishes to thank Professor J. Neyman for making it 
possible for him to spend the summer of 1957 at Berkeley, where the bulk of this paper was 
prepared. 

9. SOME MOMENT FORMULAE 
Define the rth conventional moment m,(t) = «Ур and write k,(t) for the corresponding 
conventional cumulant of order r. These conventional moments and cumulants are not the 
most suitable for a discussion of №. Instead, we shall use the moments 
дй) = 6(M +1) (N+ 2) -e (ї+®) (2-1) 


which come from the generating function қ 
Ф(6) =E a-ge 


The cumulants /,() are obtained by а formal power series expansion. of the cumulant 
generating function W(5) = log Ф,(6) in the usual way. Thus, the relations between the 
¢-moments and the y-cumulants are exactly similar to the corresponding relations between 
the conventional moments m,(/) and the conventional cumulants k,(t). It is evident, from 


тыыны $0 = mi +1, 
фз( = m(t) + 3n4() + 2, 
Galt) = malt) + Omalt) + Llm,(t) +6, 
p(t) = my(t) + 10m,(t) + 35т(0) + 50m,(t) + 24. 


In fact, it can easily be shown that 


lt) = [SPEEL malt) 157 
umbers of the first kind ( 
f equations (2:2) one can 


ga] My a(t) + [SRF] maa) +... + [Sia]. (2-2) 


irling’ Jordan, 1947, p. 142). 
нн о deduce the cwm relations 
m(t) = $0 — b 
malt) = $ — 39,() * 
22 malt) = 90) — 6020 + 7,(t)—1, 
m(t) = $40 — 1060) + 29«(0) — 15100 +1. 
However, itis not difficult to deduce from the defining property ofthe Stirling numbers, €, 
of the кесни kind (Jordan, 1947, р. 168) that the general relation is 
m) = €19.0— Sta Py all) #(- 0" qna (2-3) 
Ee: for the conventional moment generating function, and 
ulant generating function. It can be shown that 


-2--“ (1-63. Thus we have 


Let us write (2) = | 
К(г) = log M,(z) for the corresponding cum 
Mz) = e=0,(1— 07°) and hence that Kz) = 

Toli ne 
per Dana... = а 0D -67l 


1-2 


Pa араа (2-4) 
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А result of Jordan (1947, p. 202) shows that 


eS (2:5) 


and from (2:4) and (2:5) we deduce the following relations between conventional cumulants 
Lk, (t) and the y-cumulants V, (t). 


k(t) = (t) - 1, | (2-6) 
) 


Ea (O = Shpa Бату, (+. (163440) (m > 2). 


For reference we list the following: 


ЕД) = Vra (t) — Vr, (t), b 
8) = Vs (0) — 3/7. (0) +4, (0, (pu 
Est) = Valt) — 60300) + Ts (t) — v, (t). 
Finally, to complete our discussion of the relations between the k,,(t) and the у, (t), let us 
i " 2.8 
ли EO = log (1 — + K(— log [1 £]). (2:8) 
If we use the following formula of Jordan ((1947), p. 202), 


kantor. 5 вт, 
=m - 
then we can deduce from (2-8) that " 


Yalt) = |80) [S771] E, 100) +... + [S2] Ie (t) + [S7] 16,00) — 1]. 


Whilst this last formula is occasionally useful for computational checks it will have no 
explicit use in the present paper. 


*Q(s) = ІМ e-dO(t) 


for the Laplace-Stieltjes transform (L. 


-S.T.) of Q(t). If Q(t) is any function which is inte- 
grable in every finite interva] then we 


write 


when Q(t) has both а L.-S.T. and a L.T. then *Q(s) = s, Q(5). 
* (s) undoubtedly include much that is known. How- 
ilable a satisfactory and complete treatment, dealing 
ace transform, which leads to the somewhat surprising 
we give the following proofs in some detail. 
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LEMMA 1. Ги, < oo and 
T 
08 (x) = == | en dz 
0 


then, as s > 0+, I 68 (x) d F(x) = o(s). 
0- 
Proof. For any в > 0 we can find an a(c) such that 


L z"dF(z) < e, 


To 


т 
віпсе И» < oo. But OS (x) < <" Г e-* dz 
< 2"/s. 
Hence, 05 (z) AF (z2) < els. (3-1) 
To 
211 а е-з%| 
Also, for a < ду, е 0% (=) < саш. 


and for s < 80(20(6), 6), а [1 — e 77] < в. 


Thus li “ауаға < ele. (3-2) 


The lemma follows from (3:1) and (3:2). 


Lemma 2. 
о?а? (— ах)" (= 0)" ota) - 
xl фе азат” ар қ ш 


Proof. Тһе L.T. of the right-hand side of (3-3) is 


1 а о? (ед, (ers 14 
> ae us oer 1+1 (s+a) sn sta 


Lemma 3. А necessary and sufficient condition for F(x) to have its first n moments finite is 
E; , 


that, for real s > 0, *F(s) have an expansion | 
158 (-8 ? ln 
*F(s) = 1-454 5-— ta + o(s"). 


Proof. Suppose first that а, gy «++» Hn Те finite. Then, by Lemmas 1 and 2 we have 
roof. slo 52 


*F(s) = I ed F(2) 


—8)" (= 8)" " 
= |" онай C n јан 


xin 
lo ә £ EN n+ o 1). 
= 1-8 Rr Fy | 8)” +8 (; 
i ssity half of the theorem. 
bee ны re part define the shift ү, Е" л the rule E^g(s) = g(s + A), for 
TOV: ^ her 
i ivative of *F(s) on the right is given by 
any function g(s). Then the kth deriva и 
w TE 
Е *) *F(s). 


x* pens) = E ГЕ. 
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If the expansion of *F (s) announced in the statement of our lemma holds, then for k < n we 


have * PUNO) = и, < оо. 
E^—1 k fa 
However, *pk+ (s) = E ( Ё ) І e-* dF(z), 
һ=0+ = 
© —һг _ k 
=m І ( 1 ) e-** d F(z). 
h-04- = 
© hs. k 
Thus m= Lt [ ( sk ) zd F(z) 


where cis chosen so that for all 0 < 2 < ewe have 1 — e~? > 1z. The finiteness of jz, (regarded, 
temporarily, purely as a coefficient in the given expansion of *Г(8)) establishes the finiteness 


of © 
[ zkdF(z). 


ГЕММА 4. A necessary and sufficient condition for the distribution function F(x) of a non- 


negative random variable to have its first k moments finite is that there exists another distributio 
function Fay) such that 


Мә о lh 1 Ёк 
*F(s) = 1—8 ев — us Di (—5)® ‘er (—s)* * Fas). 
$ 5 
Proof. Obviously, if the above expansion is possible, then Lemma 3 and the fact tha 
*Foy(s) = 1 +0(1) азз— 0+ prove the finiteness of the first k moments of F(x). e 
Now suppose that we are merely given the fact that jj, < оо. We shall assume Ё prm М 
the lemma is trivial if = 0. If the mean у, is finite then we can define a new distributio 
function 1% 
Folz) = — | {1— F(z)) dz, 
Py ЈО 


with L.-S.T. ы) 


*Fo(s) = m 
1 
From Lemma 3 we see that 


*P(s) = 1—pys+ est — ... EP (sje оба), 


k! 
so that “Рр md y Дз а. JE (дук k-1 
(8) By, tae,” d rx 8)*1 + o(sk-1), 
It then follows, 


also from Lemma 3, that the first (2-1) moments of F(x) are finite, and 
ese is given by /1,,1/(r +1) Hı. Notice, moreover, that 


*F(s) = 1—5,5*Fqy(s). 


We can now repeat the process, e.g. define 


that the rth of th 


Е (т) = ЗА [а — Fay(z)) йг, 
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and obtain a new distribution function (ж) whose first (I — 2) moments are finite, and given 
by the expansion 


* 203 204 o 2и к—° РА 
Foals) = l-am a ndi [жау 2+ o(sk-2), 


For this second ‘derived’ distribution function F(x) we have 
*F(s) = 1— 44s nr * fos). 


Evidently, repeated derivations of K(x), F(x), ..., Fuj(2), will prove the lemma. 
Whilst strictly irrelevant to the present discussion, the remark is worth making that 
Lemma 4 is capable of trivial modification to include distributions of unrestricted (i.e. not 


necessarily positive) random variables. A final preparatory result is the following. 
Lemma 5. If Q(x) is any function of bounded (total) variation (B.V.) then 
*Q(s) = o(s) as 8->0-. 


Proof. (—s)*QUA(s) = І L e-**(s)* Оа). 


But, as s > 0+, e~**(sx)* — 0, boundedly. This proves the lemma. 


4. THE ASYMPTOTIC FORM OF ¢,(¢) 
We need first 


Lond ЧАЯ НШІ _ а 
Proof. Since ф,(0) is evidently а non-decreasing function oft it is legitimate to consider the 


L.-8.T. #ф,(8). 
Now palt) = EN +1) (2+)... Mm) 
= $ (+1)(Е+ 2)... (0) (0 — FeO}. 
k=0 
Thus б (в) = > (&+1) (6+2)... (Ё+ n) ([F(s)]* — [*7(5)]*9) 


k=0 


= 
= т!{1 108) X, 
The lemma follows at once. 
We remark that it is the particularly simple form of the L.-S.T. *$, (s) that makes ¢,,(t) so 
much more convenient, for our present study, than the conventional moment m(t). 
Provided д, < со we can consider the function 
m * Fs) 


"40 = та C 


(6+1) ы +) ts pt. 


have the following three lemmas 


Concerning which we ; 
bounded total variation, and 


Тлммл 7. If Р) «5 and ш.а < co then Knl) is of 
= a. 
К„(со) = G41) abn 
5 tal variation follows from the fact that if 
Proof. t that K,,(t) is of bounded to i 
4 ons звао both F(t) and Fop(t) are finite, and from Lemma 6 of Smith 
iud is found, from the Taylor expansion of *Foy(s), to bez, al (n + 1) Ы 


19 £F 
id E nian ae un «К, (s) we deduce that *K,(0) = К.,(00) has the value announced. 
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LEMMA 8. ина < ©, so that *K, , (s) exists, then 


*KEQG) _ Инза 


1 TS 4-1) 
8 (п--1)ш, [s Т) 


Proof. We know that 


E Ға л 
* Ios) = EC irm SK (8). 


But the right-hand side of (4-1) is equal to 


M+ 1)j, 1— *F(s) 


База * Fas) Те *Fay(s) ше LS e] а= P431 Қаз ($) 
(+1), |1l—*F(s) 1- * (3) 1 — *F(s) ( 


The lemma is thus evident. 


LEMMA 9. 
-— 
“0 = оек). 


Proof. Direct computation establishes the lemma. 

So far we have been able to deal exactly and precisely with the functions and coefficients 
which have arisen. If we were to attack in this way the theorems which we now approach We 
would rapidly become involved in hopeless complication. Our primary objective in this 
Section is merely to establish that ¢,,(t) has a polynomial form with a remainder term 0 
& certain type. At this stage we shall therefore introduce, and make considerable use of 
certain notations which will preserve the essence of the various relations in which we have 
interest, without preserving a great deal of undesired and heavy detail. We shall write У for 
any constant, and y, for any finite rational function of Иль о, -+> Hn. For instance, we woul 
replace ji jt, by Уз. However, nothing is to stop us calling и и, by the name уу», or у, if bes 
want to. We shall also write * Mg, for any multinomial expression in *K,, *K,, ..., Ки» with 
coefficients Уш and with the maximum degree of any term not greater than d. We shall 2180 
Insist, when w > 1, that the degree of *K,, in any term is not to exceed unity. For examp. ue 
of the use of this symbolism we offer the following. 

(i) 

(ii) 


My *KS*K, + fg*K3 = * МЗ, 
My *Ko* KE + ш*Кь = *M$. 


a function of B.V. Incidentally, 


dE it is easy to see from Lemm® 
» Which is the same thing as * M2(0) 


» the total net variation of M а (t), is simply 
Уи: 
Expressions such as Ya 


* Mi, ete., may occur several times in the same equation; but it 
must not be supposed that 


at each appearance they represent the same function. Howeve™ 
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the following laws of combination should be noticed. If r 2 max (p,q) and d = max (b, c) 
then 4 ni 


Yp t Ya = Yn (C1) 
Ура = Yr (C2) 
* Mb +*Mg = +8, (03) 


*М5* 11° МЕ if p=q+ ] m 


= *M* otherwise. 


Тһе slight complication in rule (C4) arises from the fact that in a multinomial * М4 with 
ш>1,а%К, may not have a higher degree than unity in any term. When we multiply two 
multinomials, of the same weight w, it is possible for a term containing a factor *K® to arise. 


It should be clear that we may consistently write 
* M. 0, = Уи- 
Lemma 10. 1/4 > 1, then 
жуа 
tts = Tony XM Fe 
Proof. А typical term in * M% could be written as ,, “Kp, *K p, -.. *К р,» wherer < d and 


every р; < w. This term could be written, alternatively, as *&,, *M$ *. But Lemma 8 states, 


essentially, that K 
a Pi — Ta +* Mhr 


Thus, remembering that p, < w, we have by (C4) that 


К “ме = Ушта. MET + EME д. 
8 8 


From this symbolic reduction of a typical term іп * MÀ it is thus apparent that 


x fd «uct 
Me уа "Ин (42) 


If d = 1 then the lemma is proved, since *M®, = Yu: If not, we re-apply the above argu- 


ment to *M2-!, and deduce 


- x ]yd-2 
«ме Mu + Ма. (4-3) 


——— = Уш+ї n 


On combining (4-2) and (4-3), and using (02) and (C3), we have 


ua 


4-2 
®М Mw жуба Мы Yon + Мал. (4-4) 
3 Уан”; 8 


If d = 9 the lemma is proved at this stage, and it should now be clear that repeated 
reductions will eventually prove the lemma, whatever the value of d. 
In our discussions of remainder terms we shall find it convenient to use the following 


Classes of functions, which we now define. All functions, as has been stated before, are 


assumed to vanish for negative argument. 
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DEFINITION 1. The function A(t) belongs to the class Z if and only if it is B.V., tends to 
zero as t approaches + co, and satisfies the condition 


A(t) -A(t— x) = 001—1), (9) 
as t — +00, for every о > 0. 


DzriNrTION 2. The function A(t) belongs to the class Z if and only if A(é)/(1 +t) belongs 
to L. 
It will be convenient to write Z = 2 n F for the class of all those functions in both @ and 
-£. An easy result which we use later is 
If А є@ and A,()e2, then Aj(t)Ag(t)eZ. (C8) 
(In fact Z is a commutative algebra over the reals and (C5) is included in the remark that 
Я is an ideal in Z. However, these concepts will find no application in the present work-) 
Гемма 11. Ги, < 00, w> 1, then 
(i) Mt) = o(t) as t > co, 
(ii) whenever G(t) is B.V., the function P(t) whose L.-S.T. is *G* M1 satisfies the conditions 
Р()-Р(-а)-о(1) as t— co, for every а > 0. = 
Proof. Terms in * M2 may contain *K, to at most the first power so that we can evidently 
write *M7 as *K,,*J +*L, where *J and *L contain only +, terms with p < w. р 
М < 0, it follows that J(t) and L(t) are both functions of B.V. Moreover, if we wr! И 
YP = *J{1— Ку} then Y(t) is also a function of B.V. and is, indeed, one which vanishes 9 
+00 (since * (0 --) is obviously zero). Rewriting *M¢ as *L -- =” *d, we see that 


t 
Ме) = Lit) + | V (t— t") d, (t), 
0 
and the fact that M(t) = o(t) now follows from Lemma 1 of Smith (1955). ig 


М = prove the second part of the lemma we notice that the L.-S.T. of Р()-Р (t-a) ^ 
simply 


{1-е} *9* Ма = 1-е} *g *r, 4 a “беу, 
Which we recognize, by Lemma 6 of Smith (1954), to be the L.-S.T. of a function of B.V: 


Evidently this function approaches zero as > +00, as may be seen by letting s > 0+. ue 
the lemma is proved. 


LEMMA 12, For any non-negative integer p, define *g, (s) = s-» * Mà. Then, if шана < ы. 
w > 1, we can find a Уш+а Such that t7?g,(0) — уд belongs to Z. 
Proof. Evidently 


t 

900) = | (£—z)? 4М4(), 

Where M% is B.V. Hence : 

т=р [ГЇ gy 

un Мы + 2, 0 0 Чи. 
This suggests that w 

Since M42(t) is B.V. we 

we can prove that 


e take for the y,,, referred to in the enunciation the value Jf: 20490), 
Shall have proved that t?g,(t) — М4(--со)ів B.V., and o(1) at + 


мй = s ave) 
0 


is B.V. and vanishes at + 00, for every integer r > 0, and every distribution function Ү(@), 


| 
| 
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Choose є > 0, and take T, so that Т(Т,) > 1—e. Then for all > T, > eZ, it is clear that 


1 4 
10 < ж | ^1 rdV(z)-- fare 


0 


< 6-е. 


Thus 1,4) does in fact vanish at +00. Moreover, an integration by parts will quickly verify 


that 5 
l(t) = V(t)- (ЕЕЕ 
0: 8 


But, because 1,(1) is non-negative, this displays 1,4) to be the difference between two in- 
creasing functions which must, since V(t) < 1, be bounded. Hence l(t) is B.V. 

To finish the proof we must show that the condition (*) used in Definition 1 is satisfied by 
tg (D. To this end we remark that an easy deduction from Lemma 10 is that 


жМа Mi(+00) «Иа, 
8р s? 8р1 c 


m 


(4) L t 2 2-1 Ма 
Непсе Lx — M2(--o0) = ғ ik (: Е dM tas), (4-5) 


where M2 (2) cannot be assumed to be B.V. However, let us define 


шы 
m(t) = [б 4 амо (2). (4-6) 


0 


We claim that the proof will be complete if we prove that m(t) satisfies the condition 


п) та) = 01) ав 4-- +00, for every а > 0. (%%) 


art of this proof, we have already shown that m(t) = o(t) as 


For, by (4-5) and the first p ) = o(1), and so, if (**) holds then (*) is satisfied by m(t)/t 


t+ +оо. Thus (1 —#/(¢—a)} m( — 
ich i ired). | 
j e er neon we can now see from (4-6) that all we need prove is that 


if, for any r > 0, we define in , 
РНЕ 


Р айға 
1. an Gdentiesl with one performed already inthis proof) above а 
d 
мй у mela, = NERD. (#7) 
E Л 


ht-hand side of (47) tends to zero ast ^ +00, and so we 


Lemma 11 (i) has proved that therig of Mercer's theorem (Paley & Wiener (1934) p. 58) that 


сап deduce from the integral version 
п.) = . Hence 
Du lim | Ar) де = 0. (4-8) 
im 2 
i-o /1-а 
sfies (**) follows from the resulting equation, 


н 4) sati 
Multiply (4-7) by t. Then the fact that n,(t) which was proved in Lemma 11 (ii). 


(4-8), and the result that Ито satisfies (**), 
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THEOREM 1. [Гилл < © then | 
ФА = y,U уай... ыу, ns + Olt), 
where ot) e Z. 
Proof. By Lemma 6 


*b,(8) = n! [*0,(s)" 
=n! Е к а by Lemma 9, 


2n, *M і ЖИ? 


ж n 
S"! gn-l guog € aen An. | 


(It is to be recalled that the rule excluding powers of *K,, greater than unity in * M4, does 
not apply when w — 1.) 


Let us now apply Lemma 10 repeatedly. We deduce that | 
ж 2 
tgal) = 178 TAS 


* Мп 
8"  gn-l  gn-2 "esse 213 Е 


Yi „бв „И i 
FE gn taa gn-2 Жаа Teck * M$, 


= etc., 
9 
s" 8—1 8 


Thus it follows that 
Palt) = Vil" у" +... +Y t+ Мі). 


The property claimed for the remainder w 


> which 
(t) follows immediately from Lemma 12, in W hi 
we let p = 0. 


Lemma 13. If uis < ©, т > 0, p > 0, then 
ж po») 
yp 7%) 
Ир 
is the L.-S.T. of a distribution function F(x), say, 


whose first n moments are finite. 
Proof. Evidently 


(—1)? *F0(s) = | ” esta? ар (а), 
and so, if we put АЕ 


аға) = аа) im | 
the lemma is obvious. " 


THEOREM 2. If hasip < ©, т>0, p»0,then 


Bal) = PEAS Еу er | 
where A(t) e 4. ш 
Remark. 


: t 
' This theorem bears a close superficial resemblance to the previous theorem. pu е 
provides us 


with much more information about the asymptotie behaviour of the remain й 
term, and requires a somewhat more intricate proof. We need the extra information CO” 


cerning the remainder term if we are to discuss the asymptotic behaviour of the cumulant 
which involve products of moments. 
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Proof. The proof is based on the fact that ( i 
at ( — 1)? .0(PX(s) is the L.T. of іо, (t). Н 
before we enter the main part of this proof we shall establish the о abe ick 


рур PWs) = — Ya Yau, (1 
( P sou) dtum Pee ro |» (4-10) 


То prove this preliminary result, we observe that Lemma 10 leads easily to 


* Ma 


- Ув Р 
= ^ tiat Тыва eeu SP an LE ME, (4-11) 


However, Lemma 5 and some repeated differentiation will demonstrate that 


di? 1 
EM p-l*y n = S 
( 5) 5 Гар 49. 


so that we can deduce from (4-11) the fact that 


ау» “Му Бен Тан 1 
(-Ә 5 = bue). (412) 


4-10) then follows at once from (4-9) and (4-12). 


The preliminary result ( 
ain part of the proof we remark that by Lemma 6 we can write 


Turning now to the m 
т! 1 Я 
x95) = 5 р m zal ы 


is the sum of a number of terms like 
x poX(s).. *FOn(s) 
ГУ на =)" 
In such а term, Г, the number of factors in the numerator is 7, which is also the increase 
above n of the power of {= *F(s)} in the denominator. The multiplier y will depend only on 
т, and not on the moments /44, 4а» >: ete. Furthermore, one must have 
httecti-P 
r+t <p, 
maxj; « p-r-t*l. 
i 


so that (— 1)» Ø) 


80 that 
and 
To substantiate these claims, consider what happens if Г is differentiated once with respect 
to s. We obtain a number of terms: 
(а) one by increasing ¢ by 1; 
(b) r terms by increasing each j; 
(c) one term by increasing r by 
numerator. 
It should now be evident that an inductive 
tions concerning I’. 
We shall next rewrite 


by 1, in turn; 


] in the denominator and adding a factor *F® to the 


argument will prove the correctness of our asser- 


T, using Lemmas 4, 6 and 13, in the form 


А " jg f t 
T =p x pu +02)... + Pd {* Ку} филе 
This equation can be written, more simply, a$ 


T = Yp *G(s) iners (4:13) 
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where G is a distribution function whose first | 
(n+p+1)—(p—r—-t+1)=n4+r+t | 


moments are finite and representable as y,,,,,. A consequence of this is that, whenever 
p<n-+r-+t, we can use Lemma 4 to infer that 


*G(s) 1 Үл+р+1 Yn+p+1 
PETS аты P Teck s T Уларзіж (1,18). 


If we couple this last observation with (4-9), in which n is to be replaced by n +r +t, the? 
we can speedily deduce from (4-13) the representation 


ИЕН 4 


Yn +1 Yn +1 Yn+p+1 
dea уны Те Ка а 


where „ О is to represent the L.T. of a function of B. V. Ви&Гїза typical term in the expres | 
sion for (— 1)? 2), and hence we can infer that 


Yn+p+1 , Ynsp4i Yn+p+1 4-14) 
(= P Уа) = тту ag PH ESO, ( 
where „О. continues to represent the L.T. of some (different) function of B.V. 


ient? 
At this stage we invoke the preliminary result (4-10) to infer that many of the coefficien 
in the expansion (4:14) must vanish identically, and that 


(1p фа) = 7а +... ан ‚о, 


өтері” gntp 5777 врт 
Q(t 45) 
Thus Palt) = y UE ys +... Нуна +90. s 


Moreover, since Q(t) is B.V., it tends to a limit О( +00) as t -> +00. Thus s, O(s) > О( ar 
ав 8->0-.. But we must have s,,Q(s) = o(1), for (4-14) to agree with (4-10). Hen á 
О(--оо) = 0; and since the remainder term ЄР Q(t) must be bounded (by Theorem 1), we ә 
rewrite this remainder term as А@)/(1 + 0), where A(t) is of B.V. and vanishes at + 00: d 
We shall next prove: (a) that A(t) satisfies (*) and hence belongs to 2; (b) that A(t) helar 
to Z, and hence to 2. It will plainly be enough to prove (a) and (b) for Q(t) instead of Al à 
Let us, temporarily, assume that we are given only that д, < co and not the full inform 
tion fon 44, < оо. Then the argument that led to (4-14) is still possible, but we can no 10006 


18 
assume © to be B.V. However, reference to the argument following (4-13) shows that «2 
the sum of terms of the forms 


(i) Yn+p+1 x Gp] (p < n+r-+t), 
| (4) „ОМ (rete p). 
АП terms of type (i) are L.T.’s of functions of B.V. and must automatically satisfy (27): BY 
Lemma 11 (ii) the terms of type (ii) are also L.T.’s of functions which satisfy (**). Thu? 
satisfies (**). In other words we have shown that if Баараа < © then we may write 
A(t) 


16) 
Palt) а (4 
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where A(t) satisfies (**). If we compare (4-15) with (4-16) we deduce that Q(t) = A(t)/t 


He 
= Ааа 
; АЯ 


Q()- Q(—2) = а-а), 


and the demonstration that Q(t) satisfies (*) f 
a 2 ollows from the facts that A(t) satisfies (** 
that Q(t) vanishes at +00. This proves (а). реали 
To prove (b) let us suppose temporarily that we are given that и, руз < оо. Consider 
+ H H . q 1 H ор : 
terms of type (i). The distribution function G involved in a typical term was shown above to 
have its first n+r-+t moments finite, when 4,,54, < ©. We are now assuming that 
Кърал < 9, and so we can take it that G has its first n +r +t О 
: + 1 moments finite. Hence, th 
P : ` d 8 
first moment of Gip will be finite, and representable as Ул рә. Ву Lemma 4 we can therefore 


infer that terms of type (i) can always be written as 
Ynt 
кыш +Yn+p+2 *Ө+п(8). (4-17) 


In addition, terms of type (ii) can, by Lemma 10 and the finiteness of е, be written as 


ж л 
Улары = G(s) +*G(s)*M2}2;1(8), 


and thence as Tees ++ *бш(в) + *@(в) 35515. (5). (4:18) 


If we combine the new expressions (4-17) and (4:18) of all terms of types (i) and (ii) then it 


is evident that „О can be expressed in the form 


утыз a (в), 


. V. But a (necessary and) sufficient condition for a function 
T. be recognizable as the L.-S.T. of a function of B.V. 
vanishes at +00. Thus Q(t) belongs to Г. What 
e stated as follows: if j5,,,44 < oo then 
П(4) 
Galt) = vi a IY ры 


(4:19) with (4-15) we deduce that I(t) = Q(t)/t. From this 
$. This brings to an end the proof of Theorem 2, 
polynomial approximations to the j-moments, and are 
to the study of the V-cumulants. Before proceeding, 
about ¢,,(t) if ача = ©. In particular, suppose we are 
as in the proof of Theorem 1, we can at least get as 


Where E(/) is some funotion of B 
to belong to L,(0,co) is that its L. 
Hence Q(t) – Ул+р+? belongs to Гл. But Q(t) 
we have hereby succeeded in proving may now b 
(419) 


Where П(ё) е Та. If we compare 
equation the proof of (0) is apparen 

We have now proved the required 
ready to turn, in the next section, 
however, let us see what can be said 


only given that дал < 09,4 € ?* Then, = 
аг аз " 
Yı Y2 Ya ay + М", 
*ф.(8) = gut gi eise tont а-а 4 
emma 12. 


Тһе following is then an easy deduction from L 


Conorrany 1. If аа < © 97 0, then 


à. (D = + y, FON у" tt (1+#)"-2A(d), 
where A(t) eB. 
Our results would have greater 8 


but we have been unable to prove % 


ere able to replace 2 by 2 in Corollary 1, 


ymmetry if we w 
ment is allowable. 


hat this replace’ 
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5. ASYMPTOTIC BEHAVIOUR OF THE 1/-CUMULANTS 


THEOREM 3. ЈР 4з < ©, p > 0, then 
Vall) = аа 70 
n Уа Yn+1 (1 +02’ 
where A(t) e 2. 


Proof. i, (t) is a linear combination, with numerical weights, of a finite number of terms 


like 
2.00 дь)... Pp (t), 
where p, *- pa ^- ...- p, = n. By Theorem 2 we have 
А, (t) 
nj 


$») тесу a ы +. + Урна (1 ES {)"+ю-Р) á 


where А, (f) e 2. Hence 


$5, (t) Фр. ( ... 9,00) = yt + yst"71 +... + Yn + RU, 
where R(t) is the sum of a finite number of terms of the form 


Àn (t) A(t) A, (t) 
ПМ - Se З NES 
€" (110) * (p pee Y n QU pui 


with m+l,+1,+...+1, < n. By property (C5) of the class 2, this form сап be rewritten 


к). „ГЕ 
(14-0)? UET > 


where Alt) e 2. But we must have sn + (s— 1)р-ХІ; > т, so the function in the braces m 
clearly in the class Z. A second appeal to (C5) allows us to write 


R(t) = ———— 
for some A(t)e 2. Combining all such remainder terms we discover that 


V.) = TELE Era (Ac 2). (51) 

The theorem will be proved if we can show that the coefficients Y» Yo se Ул ліп the right" 
handsideof (5-1) all vanish. These coefficients depend on the first n — 1 неса ЖОЮЫ йт 
of F(z) and on no other moments, It might be possible to follow through the algebra of their 
derivation and then to show that they automatically vanish, but the algebra involved seen 
prohibitive (although we return to this topic in the appendix). For the present proof La 
adopt a different attack. 

Let Fn be the space of all distribution functions of non-negative random variable? 


whose first, (1 —1) moments are finite, and let -Z, , be the Euclidean space of Qu 
dimensions. Then we can map ¥,_, into Z, by letting Fe F map into the point 
b= (и ts; 13/4) Whose co-ordinates are the first (n—1) tameii 4 the correct order: А 
Е. Call this mapping T, so that Ta- F = п. Then we wish to show that the rational funt. 
tions y, y, ..., y, of (His Hos ..., 1) vanish throughout T, 14, 4. It will be sufficient 
jpe nea find an (n— 1)-sphere 5, ус n-14 n- and show thet ха rational functio” 
vanish throughout S, ,. Let Я „1 be the class of F(a) which can be written 


1-- 


1 
(ay tet Беата) (ә > 0), 
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where ру, vs, р, з are real and positive. We show below that T^, ; £, | contains an 80, р. 


Thus it will, in fact, be sufficient to prove that уу. Уз, ..., Y, , vanish whenever Гв”, 1. 


at T, , 4, , contains an 5, үу we remark that if Fe £, , then the {x,} are 


To prove th A 


given as elementary functions of Ру, 2%, -+s v, 3: 


r! 
„== к” р (Ғ--1,2,...). 
It is simple to show that there is a point (Vi, Vos ..., v 4) in a spherical neighbourhood of 
which the Jacobian (ра: Hox sse) 


BP, Pay es Раа) 


does not vanish. Thus there is a point (И, б, «+s Ил), а Spherical neighbourhood of which 


isin 710,1. 
Suppose Ғв%, , and generates a renewal process Ху, Xo, ..., in the usual way. Let us 
suppose v, < v; all i, and write Р) = 1—e77^. 


Then we can uniquely determine a renewal process Xf, Хх}. .. by demanding Р(Х) = F(X,), 
all i. By the construction of F? we see that Xf < X, all i and hence, if we employ obvious 
extensions of our notation, Nf > N, This last inequality implies $1(@) > Q, (t) for all n, and 
ng function Ф(0) is undoubtedly analytic in any circle C, = (£:|£| < є) 
is analytic. But a routine caleulation shows that 
ФО = exp {& (1 D 
" (1—5) 


Thus Ф) is analytic in Ол. We use this fact presently. 


so the generatir 
within which Ф 6) 


which is analytic in бі. 
If Ped, then 1 "3 1 | 
" “Қа; ly» (5:2) 
en it is evident from (5-2) that * F'(g) is real and strictly 
аһ сє апат + 0then |*F(s)| < *F(c). Thus the (%— 1) 
i = 0 and (n — 2) zeros in the open half-plane 

ze * - t consist of опе zero at 8 
c M иа all these (m— 1) zeros must be located at regular points of *F(s), 

. Furth у 


although some may be locate d at zeros of *F'(s) so that the possibility of multiple zeros 
1 а 


cannot be ruled out. Now consider the equation 


Let s = сіт with о, т real. Th 
decreasing in J = |0, +00), and th 


sp) - 1-6 (ч) 

some small neighbourhood, С,, of the origin. Then it 
of this paragraph, and from the theory of implicit 

87) that (5:3) һав (n— 1) roots (©), z (E), s Zp—a(€) 
Which are continuous functions of ¢. With no loss = Е Keen xh n - 0. 
Then, if ¢, д are sufficiently small, we shall have A ааа M inus 9 cis e 
half-plane o < —28, |4()| < 8, whenever үк Moy (5) iid (6) alwa; Еріне di i d 
Eos t ient ір > on iu 06) | Р), раса н ene of degree 
(1.— 2), P(s) a polynomial of degree (n— 1). This is legitimate, OF к. 
„то = | ete 


0 


Where £ is complex, and confined to 
follows from the previous discussion 
functions (Goursat (1933), vol. п, P- 2 


Biom. 46 
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then Lemma 6 leads to the conclusion 
DO = -g-ra 
(1-6) P(s) - Q(s)' 


Р ік 5 d b 
Apart from the factor s, the polynomial in the denominator of (5:4) evidently vanishes 8 


2(0),2(О),.->2,-1(0). Bearing in mind that z,(0) = 0, some elementary manipulation shows 
that (5:4) may be replaced by 


_1 
8 


TI (s—2,(0)] 


ое (#0 
ih і8-2/01 

Thus we may adopt a partial fraction expansion 

NE (50) 
«040 = 3 ES 
Ee TE (0 (0) on 
265) = 23 (j21,2,...,n—1). 
I [z;(£) -2(401 


ij 

Since, for 6 € А,, the z;(£) are distinct-valued continuous functions of & it follows from p 
that the Z;(£) are finite-valued, continuous functions of Се A,. Thus we can find а Me 
М < co such that |z(£)| < М for all j, uniformly in £e А,. Moreover, it is apparent eii 
Z,(¢) cannot vanish in A, for this would require that one of the factors in the ape (0 
(5-7) vanishes in A,. But if £e A, then |z,(£)| < 6, and |2(0)| > 28 for all j + 1. Thus 1 
cannot vanish in А, and so we can find a bound m > 0 such that |Z,(£)| > m for all бе 

We next invert the L.T. (5:6) and obtain 


же ы (5:8) 
Ф(0 = > (©) e^o. 


itten 
If we suppose є A, and use the results of the preceding paragraph then (5-8) can be rewrit | 
_ OE) = Z,(C) e 8--8,(6 t)}, 
t) = 0(e~) uniformly in Ё. If we take logarithms we have 


(59) 
ЖО -1г(0--062,(0--8,(6,0), Р 


where S,(£, t) is also 0(е-%) uniformly in £e A,. 
We have shown earlier that Ф 
,(€) that ,(0) 


where S,(£, 


” a 
(5) is analytic in C, and it is plain from the definitio? 
= 1. Thus, in some C., YY(£) is an analytic function of €. Hence 


! Ж 
Val) = zb. Tar, 


of 


where С” = Cs. If we now make use of (5-9) we can obtain the result 


40 
Vaf) = ost f, + Ofe), (57 
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n! 
where an, = 25 À жо, (бал) 


21 р 10521) 
Әл? с’ pee 


п = 


4. (5-19) 


А comparison of (5-1) and (5-10) shows that the rational functions y, .... Yn-1 
undoubtedly vanish when F e €, ,. This proves the theorem. 


THEOREM 4. Грина < oo then 
Yat) = Ynt Yu А0), 


where (Е) e 2. 
Proof. This theorem is proved by the same argument as Theorem 4, with the difference 


that we must now appeal to Corollary 1 to deal with $, (t). 

One can also deduce from Corollary 1 the following 

COROLLARY 2. Ги» < oo then 

y(t) = Ynt + (1+0, 
where A(t) Z. 
6. THE FORMAL CALCULATIONS 

at under certain conditions V; (i) = c, t + В, +, (t), 
сә Bn is a rational function of j4, з, ..., Шар and 
We now turn our attention to the formal 


Our position is that we have shown th 
where a, is a rational function of fy, fla» - 
r(t) is a remainder term tending to zero at +00. 


calculation of the rational functions a (Jt, 42» се Hn) Bats Ios о Илла). 
To justify the method we adopt. consider the problem of calculating £, (4; Jt», ..., т) 


defined on Y,,,. If two rational functions on Gra are identically equal throughout an 
(п--1)-врһеге 5,3) € +1 then they must be identically equal throughout Lai. But 
T, i4 9 „1 contains an Sinn: Thus we may suppose, in the calculation of Pg раны Lm 
that е, ул. In fact we shall proceed as though F is an arbitrary member of @ = U, €, ; if 
we calculate а, and f, accordingly then our results must be correct for all values of n. | 
If Pe then: (i) *F(s) is analytic іп some open. neighbourhood of s — 0; (ii) there is 
a unique analytic function 21(8) defined on this neighbourhood, vanishing at s = 0, and 
satisfying the relation 8-1 — *Fr(z,(5)). This all flows from the discussion in $5. Moreover, 


by Cauchy's integral and (5:11) we can deduce that if |s| < łe then 
г 204 


als) “эў 
* 1 EAC) © s" dt, 
, "3499 t (807 
e $02 a . 
XA" (6-1) 


the change of position of the integration and summation 
lating the (a) is to expand z,(s) as 


Uniform ЧӨЙ 
convergence justifies 
i ts that one way of calcu 


Signs, The equation (6:1) sugges 
a Taylor series at the origin. But we have | 
z (= 1)"* ln n 

ges pp ХО) 


n=l 
2-2 
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by the definition of z,(s); in other words we have s аз a Taylor series іп 21.1 Our problem is 
therefore merely one of inverting a series. This process has been discussed by many authors, 
and the fullest computations appear to be those given by Van Orstrand (1910), but one may 
also refer to Adams & Hippisley(1922), Dwight (1957), and Goursat (1933, vol. т, p. 474). 
ТЕ we use Van Orstrand's results we can obtain the values of 2, listed below. In point of fact 
our values of c, were calculated independently, by a method which we describe in 87 and 
depends on the correctness of the calculations for //,, and then compared with Van Orstrand’s 


results. This procedure provided an additional check on our 0, calculations. In quoting our 
results we take и; = 1. We found: 


% = fla, 
Og = — Jig + 3/3, 
o, = fly — Opto pta + 158, 


Os = — Hs + 15 fly ta +1043 — 1054 из + LOS ped, 
He — 21055 — 85 fly pa + 21002 и, + 280 py 0$ — 196058 из + 94505, 
— Hy  28j 4 56151 — 378 из + 30] — 1260 fly + 31504344 — 2808 + 6300/68 
— 17,3253 из + 10,3955, 
ds = [tg BG flay — Bpis + 6304 ig — 126], + 2520], su — 6930 н + 1575/4 
+ 210053 и: — 34,6505 l H4 + 51,975] и. — 15,400] 43 + 138, 600/08 03 
— 270,270 8 из + 135,1354}. 
Let us now turn to the more involved problem of calculating /,,. Lemma 4 tells us there 18 
а distribution function Бу such that 
1 *F(s) 
Has 


and if, for typographical simplicity, we write T4, т», ... for the moments of Fp, then 


* Fas) = 


, 


«рв = X, 080, (6-2) 


r=0 r! 
By Lemma 6 and (6-2) we have therefore that 
жфы(5) ЖС! T» 
es 


=n 
Se ШЕ 22 50-. 
at 78518 а Р 


and the coefficient of s? in the Laurent expansion of the right-hand side of this equation is the 
coefficient of 8" in 


1 Таз CER 2 т 2 n(n- 1) (n-- 2) T * ревю. 
epe gi? +.) x а die аш: um e ЧА 
which is evidently 
= CT mal d Thi Тиз... The 3) 
n= (-D* E (C Dn(n- 0... (n&d-1)z* E. aid a 
4-1 Pi Pe! ... P Mm.) (т)... (m)? 
where the inner Х* is a summation over all positive integers л, » auch 


TEN My} Po Po 09 В 
that the 7; are all different, Урл = n and Ep =й. Та Пу; Pas Рз 


T For a heuristic discussion which shows the present significance of the root 2,(s), see Bartlett ( 1949): 
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Mai -mahe tuis oe ЭШЕ чна тт 
M ni ome об, fas er than any power oth, Let us agree to mean by 
nstant term in Q(t)’ the coefficient of t? in P(t) and by the ‘linear coefficient in Q(t)’ 
et us mean the coefficient of т P(t). When F 4 all the moments of F are finite and we can 
therefore speak of the constant term in ф „(6), by Theorem 2. But the constant term in ¢,,(¢) 
must be the coefficient of s? in the Laurent expansion of *¢,(s). Thus the constant term in 
$,(t)/n! is с, given in (6-3). 

It is trivial to verify that the constant term in, say, plt) Galt) is merely the product of the 
constant term in à, (f) and the constant term in ¢,(¢), and it is equally trivial to extend this 
iw to products of several ¢,,’s and thence to linear combinations of products of several 
%,/8. But, by the well-known relation between moments and cumulants (Kendall & Stuart 


(1958)), y, 4" - vis mgle... of 
eo 57 — 1\4— © уз 1 2 Ty : 
= Eugen EIU die RIF (6-4) 


and so we deduce that д, the constant term in y/,,(t), is given by 


ghi с?з... gr 
WP т. (6-5) 
pi Pat ++ Ру 

z: weight and degree. For instance, the 


p а-а ES 
TEES [nans 
1 а= 

We shall make use of the customary terminology 
T-product 777,73 is of degree 2+ 1+3 = бапа weight 2x 1+1x 343x5 = 20. Itis to be 
noticed from (6-3) that c, is a certain linear combination of all 7-products of weight n. 
Similarly, from (6-5), we see that 2), is a certain linear combination of all o-products of 
Weight n. Hence В, is a certain linear combination of all 7-products of weight т, and our 

t . H 

method of computing 7, will involve taking each 7-product of weight т, in turn, and 
determining its coefficient in p. (It is worth remarking that this is another place where it 
has been to our advantage to be dealing with y-cumulants instead of conventional ones. As 
can be seen, now, from (2-6) the constant term b, in the conventional cumulant £, (/) involves 


all r-products of weight < л.) 

To be specific for a moment, consider 
Can түт? arise from (6-5)! In various Ways: "7. 
0104, F203, or 0:08. Thus, if we represent 7172 
Various ways in which 7,73 will arise in (6:5) correspond t 
Ж (2°), (2) (1, 2), (1) (2)*. For each separation we must calculate 6 
to the coefficient of 747} in Js- 


Suppose we now consider a general separ 
Il = Su Su --- S88 + 


a term 7,73 of weight 5 and so occurring in 0. How 
We may obtain түт} in the expansions of 6, 
by the multipartite number (1, 2?) the 
o the various separations (1, 2°), 
he individual contribution 


ation 
qp. Sp 
., Sia are а different multipartite 


n this $5, Өле” А 
tite numbers of weight 


So, are b different multipar 
4 орто Where 7 = Ratha ese ЕТ 
f ways in which the r-product T corre- 
which is involved is 


E Some general multipartite number 5. I 
à Umbers of the same weight; 81, 82 "72 4 
etiandsoon. Then theweightof is" = шу 0272 
" ©. This general separation represents one set 0 жүре 
Ponding to could arise іп (6:5). In fact, the term 1 

од, өт 06 194) 

If we write 9 ұғы + 7% then it is evident that the coefficient of (6:6) in (6:5) is 
-= fitrat tM 


] IT "mer 


"i Tal TEE 
| Үлы 


p eI: мен 


Есе = e 
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Now consider the term от, which must give rise to a 7-product corresponding to the separa- 
tion 571 Sig... 512. By the multinomial theorem this 7-product can arise іп 


! 
D M 


ӘТЕР БЕЛЕ 


ways. Similarly for the 7-products arising from 015, -.., 015. Thus the general separation II 
arises from the term (6-6) in 


| -— (6:8) 
Туу! Tyo! „+ а! z 


ways. If we combine the coefficient (6-7), which must multiply every -product from (6:6), 
with the number of ways (6-8) in which T may arise from (6-6) then we derive the form factor 


rU (6:9) 


Notice that the form factor (6-9) depends only on the general ‘shape’ of the separation П 
and not on the weights of the individual separates. Thus, for example, (1, 22, 3) (2)? (4) and 
(1) (3)? (7) have the same ‘shape’ which we will denote (+) (-) (-)?. We can thus construcb 
a table of form factors which may be used for a variety of different separations. 

The form factor (6-9) is merely the number of ways in which 77 can arise from (6:6); 
multiplied by the number (6-7) which must be a factor of every contribution from (6:6), 50 
the final coefficient in /,,. Each separate S;; in II will also contribute a numerical factor which 
we now deduce from (6:3). Suppose Si = (nf mt ... mp), corresponding to the T-product 


қ Ty = тата... трі 
мі 
Wi = Pani + рт +... + PM, 


dij = Pit po t ... - р. 
Then the coefficient of Тіп (6:3) is evidently 


(Пе ww; +1)... (w; 4d; — 1) (6-10) 
Pal pg! ... py! (пир... (т) i 
Let us write 04 = MET ert а) 
Pi! Po! «+ py! 
for the separate factor of S;;. Then the numerical factor (6-10) may be rewritten 
ді; бу 11) 
(Нн аңы. ЛДЕ” и 
Suppose now that T eant... 


is of weight w and degree 4. Then when we multiply the numerical factors (6:11) togethe?: 
one for each separate Sij, we plainly must end with 


( = yer 
E Y^ (из... (и! | m id 
Intheresultant numerical multiplier (6-12) it may be seen that the factor in braces depend? 
only on Т and not on the particular separation II involved. We may call this a r-factor- 
Thus to determine the coefficient of T in Ви we first determine ^ T-factor. Then repro" 
sent 7 by the appropriate multipartite number S and write down every possible separatio” 


(6:12) 
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of S. For each separation we multiply together the separate factors of each separate, and 
multiply the product by the form factor of the separation. The resulting products for each 
Separation of $ are then summed, and the total multiplied by the 7-factor of 7'. The result is 
the coefficient of T in f,,/w!. To convert the result to an expression involving js instead of 
T's we use the result 


Example. Find the coefficient of 7,73 in 0/51. 
. _ \з+з 
(1) The r-factor is evidently a =}. 
(ii) S = (1,22) and the possible separations of S are 


We need the following short tables. 


Separate Factor 
(1) 1 
(2) 2 
(1, 2) 2 
92) 10 
(1, 2%) 105 


Using the tabulated separate and form factors we can compute the required coefficient as 


follows, 
aration factor Product 
Separation Form factor | бер 
1 105 4-105 
x : 1x10 — 10 
ü a 2x2 = 94 
2 1 1 

T n 1x2 P й 

Пп, +1 
Total 4 75 


ы >. 2 H 
If we multiply the total by the r-factor } we find pA ee involves a term 757,73/4. It is 
; е: 
an вазу deduction from this result that fs involves а 


75 x 8! 9513, 


туза = 
(we h 
ave taken 4 = 1). 
9r this example we have purposely cp 
come increasingly tedious as we deal with В 


3 а ши 
9 guarded against is that some separations of 


a fairly simple case, but the calculations 
higher weights. The chief source of error to 
Itipartite number may be overlooked. 
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For instance, the multipartite number (15, 2) referring to the term /3/z, in 2; has 19 different 
separations. 


By the method outlined above we have computed formulae for 23, Aa» ..., Ag: We now list 
our results, taking у = 1. 


А, = ite, 

В» = — $us T $408, 

Bs = $a — Shells Sg, 

Ba = = + Hoha eM — 44и fg + 2, 


Ps = 06 — 1Зизнь— 203, + 3553 и + 125] — 465/6 H3 +8323, 


S 
ll 


— Fy Ss ps Apps — 1958 из + +88 — 605p pts + 12900 p, — p 
+ 2465/2 u$ — 579003 из + (11,895/4) иб, 
By = iig — 25555; — Тиз He + 35008 tg — TT Jg + 199506 fl g fly — 3108/8 s + FF peo 1 
+ 1015/5, — 14,56002 изи, + (70,335]4) pd i, — 623095 8 + 48, 860/03 15 — 33,2658 Из 
+ (72,855/2) ui, 


Ba = — Зо + SEus tg + 80 pty ji, — 584%; + 133/4 Ho — 25200 fly flg + 6713/3/6 +3921} 
3620, jus — 235Зу н, + 37,4644 лд, — 54, VIS — 2870), + (01,315]4) а 
+ 59,010 /15/8 pt4 — 339,990] зуи, + (034,935/2) д + (19,040/3) 0 — 218,960 


+ 1,019,970//2 из — 1,359,960/08 из + (8,294,395/16) 3. 


To conclude this section let us remark, what should be fairly obvious, that one can pt 
the {а„} and {bn} of Theorem 0, which refers to conventional cumulants, from the {on} anc 
{2n} by means of the relations (2-6) and (2-7). 


7. CHECKS ON THE FORMAL CALCULATIONS 


We begin by describing the way in which we obtained the {о,} from the (/,) calculation? 
This method is not the best way of determining the (z,) directly; that developed in $6, using 
Van Orstrand’s results, is undoubtedly better. But if one has already performed the calcular 
tions of (//,) then the present method of deriving the {о,) is fairly easy. By comparing thes? 
results with those obtainable by the method of § 6 we have a good check that no separation? 
have been overlooked in the {Zn} computations. 

Let о, be the linear coefficient in ¢,(t)/n!. Then w, will be the coefficient of 8! in ps 
Laurent expansion of %ф,(8)/ш. By the same expansion which preceded (6:3) we hav? 
therefore that 

Q4 = 1, 


d=n 1 тра v "7 
а-(-і У 1 (n4 1) (n2)... (nd) ж 75 784 TH (rn 
daa | 


Thus o, ., is а linear combination of all T-products of weight n. Notice, furthermore, that а? 
individual r-product of degree d in (7-1) has the same coefficient as it — have in 6%» the 
constant term of ¢,,(t)/n!, except for being multiplied by a ‘correction factor’ (n + d)[n- 

It will be easier to understand the argument that follows if we talk about à speci m 
T-product, so let us consider 7,72 (which occurs in z,/6!). Consider, in particular, the sepa?" 
tion (1, 2) (2) of the associated multipartite number (1,22). Corresponding to (1, 2) (2) nd 
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m ннн : те in which т, 73 can arise in (6-4) (with т = 6) in the linear coefficient 

(i) 7, 7, from o, in ф,(0)/4. and ть from e; in ġa(t)/2!; 

(ii) тут, from сз in ф,(0/3!. and т, from ws in $,(t)/3!; 
ó riy тут, from oy in ф,(0/3!, т. from оз in ¢,(t)/2!, and the linear coefficient w, = 1 from 
"NOS 

Let 2 be the coefficient of 7, 73 in 0/6! and € the contribution of the separation (1, 2) (2) 
towards the coefficient of 7,73 in 25/5!, as described in $6. Then a careful investigation of 
(6-4) shows that (i) contributes an amount (1+ 2) € towards 2. The factor (1 +2) arises as 
the ‘correction factor’ of (1,2). Similarly (ii) contributes an amount (1 + 3) Є towards 2. 

Suppose A is a given ‘shape’ of p parts (e.g. (-) C9 C and () C) C) C) С) are ‘shapes’ 
Of six parts). Let ^ have a form factor f. Then the augmented ‘shape’ (- )А has form factor 
— pf. This contention is easily checked by (6:9). 

Now consider the contribution from (iii) towards 2. Since both 7,7, and Ta are here 
behaving as from constant terms (078) no correction factor is required for them. But @ is 


based on a form factor of (-) C); whereas what is required for (iii) is a form factor of (-) (+) (+). 
By the preceding paragraph we see that (iii) contributes an amount —2@ to Q. Hence the 
total contribution to 2 by the separation (1,2) (2) is $-- €. 

an indication of the general result which we now describe, 


` 
The preceding discussion gives 
and which can be proved by а careful extension of the argument. Let 7 be a r-product of 


9 ri Sr: Tk Ч 
Weight w, and associated multipartite number S. Let 511952... ӨР bea separation of S, where 


the separate 5, has weight w; and degree d;. Let € be the contribution from the separation 
б5р... SE to the сое аїані of T in f, [w!. Then the contribution of this separation to the 
Sefficient of Тіп a, ,,/(e 4- D)! is 

ndi 4 fads ы 

w, We 

1 the (25) rapidly from the (,,) calculations. 
61. We borrow the results of the example 
]umn А is the extreme right-hand 
$6; in other words it consists 


in the example in § 
5!, but with the r-factor left out. 


ridi 4, (7-2) 
Wh 
It is th: А 
P this result that enables us to obtatt | 
“тат Я қ f 7,73 іп % 

ple. Find the coefficient 0! 71 ^m below, со 


Wo ; . giv 
tked in 56. In the computation scheme g! 
displayed 


Columr 
1 of the computation scheme =. 
9 I д 1 T51n P 
арлы various contributions to the coefficient o 1 ы Я * т oma 
1e column B b ^» cive the approPr 
elow, we give 
E тент B Product AB 
A 
Se ti | В | 
Separation | um 
 — — | * 
з + 
2 +105 2 245 
n | =й н e 
(1) (2?) a $ E P 
(1,2) (2) ae 1+? 
(1) (2)? ИР nn 
HERMANN 
Total +28 
Ен ды 
Th 
e - + 
: T-factor, as before, is $. mis ше 
US the coefficient of 7, 72 In a, 6: 18 
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Converting to //'s instead of 7's, and taking jj, = 1, we see that 2% contains a term 


7х6! пн 
оза = 280 38. 
This result will be found to agree with the value of о, given in 56. e 
The most obvious check to employ is obtained by putting F(x) = 1 — e77, so that и, к 
for all л. Routine computation shows that in this case Walt) = (n!)t+(n—1)!; thus 


i =(n—1)!. 
substitution д, = n! in our formulae for о, and 2, should give «, = n! and fj, = (n—1) 
We call this the Poisson check and all our formulae pass it. 


We now derive a useful and wider set of checks. By (5-4) it is evident that 


1 


= = 
РО, 
© б, £n [e ds _ (7-3) 
50 that 2 mI Lae cS(I-*FG)-Q' 
Where C is some circle |s| = à in the s-plane. 
Let us suppose, that for some positive integer p, 
E = =9 —8)Р 4 
*F(s) = 144.2 1) a. (b е lia. ul 9T (1 ) 


p 


А Г А its 
It is true that there is no distribution function F(x) with the above expression for ‘a 
L.-S.T., but we are interested only in formal relations between coefficients so this fact ne 
not trouble us. From (7-4) we see that 


1-ap( = (81 


e 
and substitution in (7-3) yields 
ЭЁ 1 2 ds (9 
o n Әт oS{(1—s)?—1 +4 pz 


For é arbitrarily small the contour integral ( 

ats = 1—(1— pt)", where the pth root is m 

A straightforward evaluation of (7-5) 
© Cy, En 


с 
ом ру) (pga 


7-5) has two simple poles within C, ats = 0 an 
eant to be the branch taking the value 1 at ¢ = à 
by the usual methods of contour integration then giv® 


Thus мас = 6 c, 


—1 
т o nl | = 1ор&—1ов{1—(1 0019) gn — pt). 
If we differentiate the last equation with respect to 6 we obtain the more manageable 
result 
ў баб" 21 @-рб-Ф-шр (p—1) 
1 (m—1)! | 


1 
E I= (pee + Te 
3 
6 


1 
= TPE) -de 4 (1 — ру—р—зур р +(1—pt)-e} 4 = 1) (7-6) 
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r i В ӨРҮ 5 В ы ж 
The right-hand side of (7-6) isin a form suitable for expansion as a power series in 5, and on 
comparing coefficients of 27-1 one deduces that 


А. (3 -pgi- S r+ (r+ 2p) ... (rn — 1p). (7-7) 
СЕЗУ ы тр r=1 


Referring back to (7-4) it appears therefore that our formulae for В, should give the values 
indieated by (7-7) when we substitute 
wel №=(@-1, №в=@- dis m 
sfp = (p-1)5 А = 0 (n>). 


Evidently, by (7:4) we have 


— 1); —1)(2p-1 
a ip o аа 


If we pick out the coefficient of Ё" in this expansion we can deduce that 


а, = (p—1) (2p — 1). (n71p- D. (7:9) 


Thus the substitution (7:8) should also give (7:9). 
We call the checks just described the p-check and all our quoted formulae have passed 

p-checks for 2< p< 8. We presume therefore that they are perfectly us e түт 
Ап advantage of the p-check in the computation of the {2n} Mira ist m e ise dus 

of the z, for n > p enables one to isolate inevitable errors to a few terms. This makes the 


Search for errors easier. 


APPENDIX 


A combinatorial identity 
i i iali i that а multipartite 

From T iti i deduce а curious combinatorial identity. Let us say 
iban d E Ж рор ie allowable ifits degree d is not less than its weight w (oneof the (pj) may be 
zero), If S is allowable өп associate with it a coefficient 

d! 
A(S) = |422 (ра)... (Рк) 
(d—w)! (Pr!) 

y separate 5:18 allowable. To an allowableseparation 


A separation St St ... Str of SS is allowable if ever 


®ssociate a coefficient (- D9 DHAS) --- [A05,)]^. 
BSP бр... SH) = gal Gal Im! 
wi 
RIO g = qi did — 
en the identity in question 18 tha - СЕН Бе) = 0, (A.1) 


ons of an allowable multipartite number 5, of 


rati 
Whenever the summation Х is over all allowable sep? 


“Bree d, weight w, with d > w+ 1- 
» y choosing S in special ways alarge nu 
Shall not pursue this topic. If we were 


" í есіні cases may be deduced from (A. 1) but 
xd e eite by elementary argument we would bein 
able to 

iversi f Cambridge, for the 
T boratory, Univ ersity 0 n ge, 
ТА am dudas to o D, di E of the Statistical La А Incidentally, I am also indebted to him 


Caloulati of all 5 :artite number have been omitted 

tio. erformance : ven multipartite п ias 

for Bots = of n fe and e Les HOC separations of а Е separations of а given multipartite number 
g an additional ега be checked algebraically, for the 


rm factors summed 0 1). This can 


© working. For the fo 


m à the form (п : ... t£" in the logarithmi 
totes to zero, except when the yore 3 пр») is the coefficient of af oe í ОЯ 
involved with а number (7 t, pera 


Pansion of {(1—2) (1—8) = (1-2) 
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i As it is, we deduce 
ition to remove the complex-variable argument needed in the proof of Theorem 3. Аз it is, wec 
а position 
(А. 1) from Theorem 3. 


To prove (A.1) suppose that 8*Ó,(s) = eg tes + es? +... 
за Pals) ші у Ав", (А. 2) 
'Then di E 
24 ру 
here Ad = уж Em Em Ore 
wher 


(т!) ...(лт„!)р›” 
and >* has the usual meaning. 


If we invert equation (А. 2) we have 


Фай) 54, quee 


" i (А.З) 
а tk 


Қ е А.З) 
where we may assume (by Theorem 2) that the remainder term 0,(t) is negligible. W MO mE then 
that the coefficient of t" in Фа is a linear combination of e-products of degree d and weigh . 
follows easily that the same remark must also apply to Yalt). . «one ten 

Consider a particular e-product E = eg»ef... efm, of weight wand degree d, with T; = '( 85 number 
Some e; to be absent). Suppose, moreover, that d > w+ 1, so that the associated mu "om (у mus 
S = (0% 19+... тт) is of the type required for (A. 1) to be true. Then the coefficient of t : S ие nus? 
contain some multiple of E. But Since d—w > 1, Theorem 3 insists that the coefficient « 


Dt а какаа » proof 9 
vanish identically. Hence the multiplier of 2 must be identically zero. This is, essentially, the | 


рақ. о. 
5 5 de: ines z why this i8 8 

(А. 1). We must now effect ап alternative evaluation of the multiplier, however, to show why 
Let 


П = SYS... Ste Std +o SHE... SRT. STH m 
2 а 
be a separation of S, with S of weight w,; and degree 4, for allj. This separation p in 
torization of E into Еу Ej ... EU, where the e-product Е, is that associated with Si. м саге e E peat i 
the way E can appear in y(t). Ву (А.З), assuming o,(t) negligible, it is clear that E сапа | a is then 
Palt) unless буу is allowable. Thus we must suppose II to be an allowable separation of 5, anc 
apparent that the factorization Бү... E?! must emerge, іп y(t), from the term 


7 (q— 1)! df dz Я (А. 4) 
where 


li: = У,4, and q= Eq. 
We can obtain Efe Еу... Eis from Фа in 


т! 


Gi! de! ... Фа! 

Ways, each time with a coefficient which is, by (A. 3), simply 
[A(5,,)]: [A (Sia)... [A QS, ,) Jue, from 
pply to the other batches of factors of Е of equal degree. We can then deduce T is 
ntribution to the coefficient of E, in у 4(t)/d! which corresponds to the separation 
—1)"(¢-1)![A(S,,)]™ ... [A 1S.) 
(—1)*(q-1) L ( nel] [4(5,4)] = (п). 

Чи: i2 +++ Фа! 


tive evaluation of the multiplier of E in Vra(t)/d ! is simply X 


Similar remarks a 
(A. 4) that the со 


Thus our alterna; 
(A. 1) is proved. 


denti 
* B(IT), and the ident 
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ЕЕН 


&. steps ы + әне duntmasnsl канаа ўа ure positive and Scctir sands in iu d 
& бы: “Ж; 4 Ion PEL яғына ите інен iod: пе нна phe im IÉ sig Баа Ја 
ен. is 
Mist ia FOCI) С О dato tee Зари du өтө qu er "mad гре 
Әшір. : е ете мес ри тесер E ^ ie A-d- : | 


tof 
өза Rei қана ike «бе: B роне ығы варень фа tH 
646 ÎN В the Байны, and the ван ака walle неневва eon eerie at a eonatant HLB: 


1. INTRODUCTION 


The problem considered in this paper arose from the study of the wear of conveyor bolling: 
Tneidenta which damage belting are of various degrees of severity and different: types 9 
incident occur with different frequencies. From a given distribution of the life of belting: 
information is required about the relative importance of different types of incident. р 
particular, it is desirable to know whether the wear caused by nearly continuous abrasion 
more or less important than the damage due to the more destructive but less frequent blow? 
The model, which has been used, assumes that damage accumulates until the total reaches 
& certain level, when the belt is considered to be worn out completely. The damage done by 
each incident may be considered as a step of a random walk process in which the steps occur 


aba constant mean rate, and the problem is to relate the distribution of the size of steps t° 
the distribution of the time taken to reach a barrier. 


2. 'TIME TO REACH A FIXED BARRIER 
Consider the one-dimensi 


onal random walk process in which thesteps are positive and occur 
randomly at mean rate 


т, but the size of each step, v, is not specified. Suppose that the 81268 
of the steps are independent and the probability density of the step size is f(x). Assume ibo 
the process starts from the point и = Oattimet = Oandlet 4(4,4) denote the probability that 
the distance travelled at time t is less than or equal to и. Then 


e (mt) 
dest) m em (т), ш, о 
where р,(ш) is the probability that the distance cover db : 160% 
Therefore, if p(t) is the ed by r steps is less than or equa 


probability density of the time to r 1 і і h from 
the initial position, аа айыыга 


Ж а m ю ty 
pit) == qu —q(h, 8} = те D (т) {p,(h) —Pria(h)} e 


* Formerly at the National Coal Board. 


А. Mercer AND С. S. SMITH 31 


and the kth moment of the time about time zero is 


k = (r+k—1)! 
sanr” pilh). (3) 


But p,(u) = [ D, (а) (их) da and рди) = 1 for u > 0, so that if the Laplace transform 


Jo 
with respect to a of f(x) is denoted by F(s) and the Laplace transform with respect to u of 


P,(u) is denoted by P(s), then 


Pls) = Pals) FO) = = AUG = (РОУ. (4) 
Therefore the Laplace transform with respect to hof the kth moment of the time about нив 
zero is 
roi % hal D gre» - qut = Fisyy-k. - 
Ж ш fa t ipt 
Hy [> fx) dar in ішінен ees nn az, trn | 
1 = Геза = adi. " 
Pi) - £s fs) as = E: а i T 
tor 3$: sos Hs compet te bat the test three 
ПО that frain (5) and ( ns TN E cmm yg fof the geat ше 
momanta SF the Hig about Umg ПТ? 
1 1 ау 
—t irn | 
т аты; FAT 
Bold s a =) ы | 
811 За ШЫҒЫ RC MET Low). 
эз [ра ais (a 2af) s? 802 aj 4а/8 


der, 1946) that if G(s) is the Laplace transform of g(y) 
A -k where Ё is non-negative, then under certain 


in itis well known (for example, Wid 
ations (7) suggest that as h oothe 


With res G ) es As 
pect to y and as s > +0, (s 

conditions, gy) uS атыу y^ as y — 00: iei 
an, variance and third moment about the me 


1 ШЕ 


Ет m а 241 (8) 
ШЕНІНЕН!) 
c? = ABl ad да} За А 
o a), lid set за 001). 
1 [[Заз 93) h+ s aj 401 
Haa o Ед. he ith renewal of а process and then 
Alternati be considered as ле be obtained using results 
ati natively, the (i+ 1)th step mAy sions for the moments can be o 
pres 


i т цв proof of the asymptotic ex 
?llewa] theory (Smith, 1959). 
3. EXAMPLES 


the precedin 
negative expone 


results. е р 
Me following three examples illustrate ntial distribution, (т) = 06 ёа, 
98 за 


т, баррове that the size of step h^ 8 
en е go ers = Bag 
кв) = |, 
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e uy . 
Also p,(w) can be obtained directly as e-/" У um and therefore from (2), 
а=т - 
& (mt) (Bh) (9) 
p(t) = dune! E T 


; a reach 
Hence the mean, variance and third moment about the mean of the times taken to rea 
the barrier are 1 
ий = — (1+ fih), 


m 


1 (10) 
2= 2%), 
С mat Ah) 


2 
Из = та +3107). 


But a, = 
exact. 
(ii) Suppose that the size of ste 


identically distributed according 
Then 


ü ! іс res are 
| Ва" e-f* ах = F and substituting in (8) we see that the asymptotic results 
0 


Ў ; hich are 
p is the sum of two independent variables which 
to the negative exponential distribution. 


= B \* 
f(x) = f*ze-/* and F(s) = GS 4 
Also p,(u) can be obtained directly as 


e-fu у (Ви)! 


q=2r 4! 
and therefore from (2) . | . " ii 
pit) = кезеді ^ мү (5 er] (1 
so that аз h > со, H= = E E Т) Я 
о? = из (T6 Кг Ивет), (12) 
Из = 2 lis + m +0(f2h2 e) : 


-fer 
But a, = (r--1)/* and on substituting in (8) we see that the asymptotic results only diff 
from the exact results by terms containing negative exponentials, 


ftt 1 i a 
(iii) Suppose all the steps are of equal size £. Then 2mt has the x?-distribution with ^^ 
degrees of freedom where n is an integer satisfying n > 


МЕ>т-і and и = n/m, o? = ^ $e 
and из = 2n/m?. On substituting a, — £ in (8), we see that the asymptotic values of the 
three moments are 


(2), >» 
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4. TIME TO REACH A MOVING BARRIER 


Now suppose that the barrier is moving, relative to the random walk process, аба fixed rate A 
pex unit time towards the origin. Then at time f, the barrier is at a distance (A — At) from the 
origin and the process is certain to have terminated by time h/A. In the application to the life 
of conveyor belting, the relative movement is due to continuous wear of the belting, which 
occurs in addition to the random blows. This wear is the limiting case of that in which very 


small blows occur frequently. Hence, suppose that the steps consist of those whose size has 


probability density f(x) and occur at mean rate m and steps of size a, occurring at mean rate 
ті. Then if f,(2) is the density function of the step size distribution 


(m+ my) Л(а) = mf (a) +m 6(#—=1).* (14) 
Therefore (m 4- My) y, = ma, + m1. (15) 
Where Vr = | а? f, (ar) ах 
0 
and а, = | a f(x) dx 
0 
ав previously. Now let m, > oo and 2, ғ 0 so that узу > А. Then from (18), 
(тт) У > na, (1 +) (16) 
and (m 4 m4) y, > та, fo r22, (17) 
Where j= za, (18) 


ariance and third moment about the mean of the 
rier when the step-size distribution has density 
. Although it has not been proved 
are then used, the asymptotic 
e mean when the barrier and 


ee amy mptotic values of the mean, Y 
fu ribution of the time to reach the bar! 
ri netion (а) are given by replacing 4, by y, in (8) 

Borously, these results suggest that if (16), (17) and (18) 
values of the mean, variance and third moment about th 


r 
andom walk process converge at à constant rate are, 


в= (тё) А ән) 
т\1+р/ (aV р Фа |)? 


oz lí p Mash (Y а за (140), al 
ni s) Е ( p + Gai 3 \ P сақ 
lla 5ааз (1+2) , 944 1+р 
^" ss (res) (4 LP oet M all т, T р] | 
m? \1+р a at\ Pp р 3a$ 5 4 
5. PRACTICAL APPLICATION 


h a barrier has been found. 
been obtained as functions of the mean 
e distribution, the distance of the barrier 


and random walk process converge. In 
d the problem is to make some 


ap the previous sections the distribution of the time to reac 

не sions for the moments of that time have 

bon оу of the steps, the moments of the step-siz 
™ the origin and the rate at which the barrier 

рассо, the moments of the life distribution are known an 

erence about the discrete blows and continuous wear. 

of ô(h) see footnote to p- 68. Ep.]. 
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Ifthe variation in the size of the blows is negligible, then from (19) the asymptotic expres- 
sions for the first three moments are 


EE 
ЕСЕ 


2 
3 
2 1 Р Т od PL | z- (22) Г C25). 
^ в (195) ( ard E 2 Ру ANP 
where £ is the size of the discrete blows. Hence p = т&/А is the relative importance of =. 
discrete blows to that of the continuous wear. This is the parameter in which one is € 
interested in order to decide whether the surface of the conveyor belting, which is large^Y 


: i rent 
affected by the continuous abrasion, or the carcase, which has to stand heavy € im 
blows, is in greater need of improvement. The exact expressions for the moments ar 


8) E 


Table 1. Exact and asymptotic values of moments 


p тш т?с? т?з 
оно 

Asymptotic Exact Asymptotic Exact Asympto 
1-056 0-102, 0-102 — 0-004 - m 
1-625 0-370 0-370 0-075 ren 
2-229 0-901 0-938 0-620 ppm 
3-140 2-956 2-607 5-455 эке 
3-500 3-000 3-589 6-000 T 


complicated to be used for estimatin; 
numerically that convergence is rap 
be shown directly that the first thr 


Р wn 
5 the parameters of the distribution, but it can be yox 
id if p is not large. If М is an integer, n say, then it 
ее moments about the origin are exactly 


=z- 5 2 enl, 


4-1 т=0 r! 
" 1 n n—q4l т 1) 
m= (909+ 1) 2 X" a-g uus m 
т Qel ғ=0 т! 


A 1 4 n n—q42 r 
m= а оа) $ = пака Pea. 


The exact and asymptotic values of th: 
are given in Table 1 for МЕ 
oscillates about the exact 
denote 02/2 by 04 and yl 


n 
е mean, variance and third moment about the -— 
— 3. It will be seen that the asymptotic value for the varia? 


value. Now let p = (x—1), pm = z-?y and hjé = g'a ал 
3 


10 
[ U? by о. Then the three equations given by the asymp di 
expressions for the moments are 
у= ++», 
ay? = E Bye, (22) 
Gy? = o cda? (3 2); 
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Hence estimates of p, m and h/£ follow from the solution of 
43223 y1 — 32 (152, + 825) y? + 24(2-+ 452) y? — 600y — 29 = 0. (23) 

Inthe course of our investigation equations (22) and (23) were used to calculate a table of 
values of p, ит and МЕ for certain combinations of values of a, and о. We should be glad to 
make these data available to any reader of this paper who wishes to go further into the 
application of the theory. 

It would be possible to estimate the standard errors of the estimates of x, у and z and 
hence those of p, m and #/&. However, the formulae are too complicated for any assessment 
of the efficiency to be practical but, in any case, the only possible method of estimation 
&ppears to be that of moments. 


This paper is published by permission of the National Coal Board. 
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A TEST OF HOMOGENEITY FOR ORDERED ALTERNATIVES 


Ву D. J. BARTHOLOMEW 
University College of North Staffordshire 


1. INTRODUCTION 
Я > КЕР 42% ‘ TS er 
A problem which occurs frequently in statistical analysis is that of testing whether a numb 
ofnormal variates have the same mean. Given such v 


: ; р 
ariates with mean values my, nm», 
the null hypothesis which is tested may be written, 


Hym, = mM, =... = т. 


The object of this paper is to give a new solution to this problem which has many applic 
tions, especially in the social sciences. The appropriate test of the hypothesis can only Я 
determined when the alternative hypotheses, which are considered likely or important, hav Я 
been specified. The most general form of alternative, which is the basis of almost all existing 
work, may be written, Нить: + ть =... + My 

Аз examples we note that the standard analysis of variance tests fall into this category 
The X?-test for the equality of proportions in a 2 x k © 
the numbers involved are large enough. In some applications it is possible to be mor 
precise in the specification of the alternative. When this is the case it is advantageous 1 
make use of this extra information to obtain more powerful tests. This paper deals Wt 
what have been called ‘ordered’ alternatives for which the rank order of the means is know?" 
Thus instead of H, above, we shall consider 


м Е > — le? 
contingency table is a further exam] 


Н:ту > ть >... > т. 


+ . . in 
At least one of the inequality signs must be a strict inequality so that 77, is not include? 
H,.'The practical importance of this alternative will be clear from the discussion below- 


. . H B % 20 
the simple case k — 2 the situation is met by using a single-tail test, but for k > 2 м 
distinction between опе- and two-tail tests is 


ы ке 
> lost. The present work may therefore be 100 
upon as a generalization of the single-tail test. 


Attempts to meet the need foratest a 


^ г (1954) discussed the one-way analysis of variance and gave a distribution” 
est w. i 


Eas s tor 
living conditions’, etc. Such fa€^ 4 


D. J. BARTHOLOMEW 37 


A uL MM : кк and gave tests which amount to converting the problem 
Neither of Tm | n, 9 5 ocating scores to the experimental groups. 
Since each was d feet n " ie à mentioned above provides a general solution, 
the likelihood ч a ae а pu арра Те read tests to be derived are based on 
his ating : atio and are applicable to any problem which can be formulated as a test for 
"dicen y : normally distributed means. They are closely related to the familiar y? and F 
lated cen н AEN used when the alternative is H,. Significance is easily tested using tabu- 
ions or the special tables of percentage points included in the paper. 


2. ТНЕ TEST CRITERIA 

e independent observations 24,39, ..., Xz, Where x; is normally 

а viation c; (i = 1,2,..., k). It is important to be 

> ат on the meaning of the suffices; thus т is the observation from the population with the 

Bii mean under H, but is not necessarily the largest observation. In this section it will be 
3T " . 1 

" imed that the o’s are known, but in $2-2 we shall include the case when c; = cn; 

= 5 ч . H * 

м 1, 2, ..., k), the n’s being known sample numbers and c being estimated from the data. 

ur problem is to construct a test criterion which is particularly sensitive to the alterna- 

e is Н,, can be derived by the likelihood 


ti in 
ber l. Since the x?-test, used when the alternativ 
Tatio principle it seems reasonable to use the same approach when it is Н. Although it 


à H В E 
5 not clear, іп this case, whether the test derived has optimum properties, the method has 


è strong intuitive appeal and leads to & meaningful test. | 
The standard method of deriving likelihood ratio tests leads to the following form of test 


Criterion : E 
ajo, - E Хи, (1) 


2] < 

di js 1. Suppose that we hav 
istri : 

ributed with mean m; and standard de 


- 


"Sus St 
k a 
1 


.., hy, are the values of 201, ms, ..., my which 


1 ^ xc o* 
Qaximizo (1) subject to the condition Mı > [PP The symbol X; is used because of 
е close link with the usual y?-test which will become apparent in what follows. Тһе effect 
J. a . 
of restricting the Арыта to H, is to introduce the second term on the right. The values 
9 ете ptc i inimize 
Mı, Mg, ..., m, Which maximize (1) are those which min. 


k k 


wher A A 
re а, = _ & 
e a; = 1[0?, 2 = Yat; > а; and fis, Ma 


В 2 
U,- Xa; m) , 
ж 
has been reduced to the following. Minimize U, with 


triction my 2 ™2 2... ® т. 


inimum of U,,if we use geometrical terminology. 
bsolute minimum of 07, lies in the region H, 
ted and unrestricted problems have the same 
д for i— 1,2,...,k. This solution lies in 


8 
repr Problem of determining 2% | 
Tt wj € My, Ws, ..., Mks subject to the res 
e Vill simplify the determination ofthe m 
first step is to discover whether the & 


(m, > г 
ET H . іс 
12 Mm, >... > my). If this 1 so the restri к 
bviously "i = 
» if | 
If ni >2а>..,>2,. nd 2; < 2544 for some J. Thesthadlute 
inir is is not the case then i To avoid this we must introduce the 
ч A 
ев ш would require f; < Mj+1 
eti э 5.786 . 
lon m, > туу before minimi 
U- à aj(v; — Mi 
теі 


= R48, 88У: 


$ will be possible to fir 
which is outside В». 


; ]lows- 
zing Up 88 fo 


" -— 2 
а ту) + ара( на ты) 
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Тһе minimum of E is unaffected by the restriction and is therefore zero when @ Na 
Consider now the minimum of S. The point т; = 25, тур = т; falls outside the € 
т; > mj in the (mj, т.) plane. The minimum of S will therefore occur on the boun 

of the region, that is when m; = m,,,. In this case it is easily shown that 


A. =й. - tuta 
per nce аы 
27 G+1 


It will be convenient, for what follows, to introduce the notation 


© 5. 
X(p,g) = D az; У а. 


i-p i-p 


The minimum of U, subject to the restriction т; > туу may therefore be written 


^ 


т; = уу = X(j,j+ 1), 
ыы Ил 


The problem has now been reduced to one involving (k— 1) variates. If 


45252..2252Xj-1) »z42. 295, 


. че t 
the final solution has been reached. Otherwise the problem is the same as the original bu 


with the number of variates reduced by one. It may therefore be treated in the same way PY 
taking X(j,j+1) as a single observation with weight (а)--а;1). The rule for obtain? 
My, Mo, ..., йі, may, therefore, be stated as follows. Jf 
Arrange the observations in the order predicted by the alternative, thus, 2, 25, .. Vi к 
any consecutive pair ә), ауу are not in the expected order (i.e. Ға), > 2), form the avers 
X(j,5+1) = (aja; + 4121) (9 +53). 
The series is now reduced to (k— 1) members of which (К — 2) are unchanged and one i$ Е. 
average of two original ones, Proceed exactly as before, treating X( j,j--1) as a sing d 
observation but with weight (a;--a;,.). Continue in this manner until the resulting value? 


ате in the order predicted by H,. The following example illustrates the method. 
Example Ti Y а А 


60 —73 —141 7 
а; = d, = а = а. 
We notice that 74 > 9, 80 we form their average, 


X(3,4) = 4(— 1414-7) = —67. 
Now X(3, 4) > 25 во we calculate 


32% (3,4) +23) = X(2,4) = 10.73.14 +7) = — 69. 
As z > X(2, 4) the solution is 


fh, = 60, fh, = fh, = fh, = — 69. 
By calculating йі, May ... fh i i ie 
Р е е ›%›..., Ту, in the way described 72 can i tituti? 
in (1). It is easily shown, for estimates of the form ieee by E 2.1 Er 
i= Xan-z*- Late, ae ba ce % 
: i т IND — Ma)? = x Qn, — ay, 
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| The difference between the method of calculation of the usual x? and X? is that for the 
atter we use the estimates #1, Ma ..., My instead of the original observations 25, tə, ..., xy. 


This is illustrated by the following example. 


Example i 1 2 3 4 
az; 0:83 188 020 044 
a 5 10 — 10 5 


i 
Carrying out the procedure described above we find 


йү = = 098, з = йы = 0-28, 2- 0-63. 


Hence F? = 5(0-98)? 4- 10(0-98)* + 10(0-28)? + 5(0-28)? — 30(0-63)? = 3-68. 
alternative method of calculation. We could look upon the 
ns, X(1,2) = 0:98, X(3,4) 2 0:28 each with weight 
he usual x? sum of squares using X(1, 2), X(3, 4) and 
thod is always available and is, in fact, the 
rocess is complete we have / means, 


The example also indicates an 
Problem as involving two observatio 
A +10). 3$ would then be calculated as t 

he appropriate weights. This alternative method 
Simplest method of calculation. When the averaging р 
ха, ky), XT 1, kı + ko), es XU k+ 1, k) 
with weights A(1, kı), A(k, +1, ky + ka), s A(k— E +1, E), 
d 1 
Where Alp) = Ха Хи 
? 
Xi is then obtained as the sum of squares of these means. This will be referred to as the 
Teduced form of 
t blem. А 

2-2. In this d ies the modifications necessary when c can be estimated from the 
data, ; “к arises in the one-way analysis of variance by groups where 
Consider the problem as ee, which are normally and indepen- 


We have ob i с то A 
servations жы ( = 1,2...) = > s 
dently distsibuted мы үң mean m, with common standard deviation с. Define 


k k 
m. oes 1 Уау and #.= Enn Zn. 
n 
Nij=1 j 
then уу zm д come from populations with the same mean value. 
eh °F, „Жа oe в. 

e рео т, rne it pode im ejns and therefore a; = nj|c?. ае obvious 
Modification of Ёё test oriterioń ist ent of c? ees atu | 
» "tained from the analysis of variance 

€come 


he replacem 
table. If this is denote 


Ў пит z,e 
1 
: follows. Suppose, for example, that 
Howe ishti $ can be obtained a5 -— 
Б ver, a slight improvemen e 2 test criterion we form 
d 22: ting the te 
1. < Di. then following the rule for calcula g 


XG,i+ = 


(в: Misa) 
)th groups and then calculating the 
e calculated from this new 


(n, ate. 
Thi, ; s ith and (8+1 
818 exactly equivalent to pooling қа E residual mean squar 


"Vera, к d. Th 

ge of the single group so forme hät 

table will have n m degree of pae ei m bya 
tween groups’ degrees of freedom have bee 
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constant. In general, it can be shown that if the reduced form of the problem consists of 


1 groups then the increase in the residual degrees of freedom will be (2-1). The test criterion 
to be used is therefore defined as 


ESSA deni wa (3) 
В, = Xn(R-z Pl) 82, 
1 
а Я : T һе 
where S? is the residual mean square of the analysis of variance table obtained from t 


re, Тоқ s the 
reduced form of the problem, having (N — 1) degrees of freedom (^ = Мп ). Р, is thus th 


1 
Р trating 
ordinary mean square ratio calculated for the reduced problem. An example, illustrating 


EX. » oriterion 
the method of calculation, is included in $5. It can readily be shown that F, is the criterio 
produced by the likelihood ratio principle. 


З. THE DISTRIBUTION OF X? AND F, 

3:1. The distribution of xi. 

we consider first the case /; 
have 


Те ; ^ n 
Toillustrate the method of approach to the distribution pr me s 
= 2. This must, of course, reduce to the familiar one-tail test- 


X = а(х, 2): for $2 Ly 
= for z,2 3X. 
For tests of significance the probability required is 


2 

Р > ур= рг [n > 13; X a;(x;— T)? > » 
1 

| 


when the null hypothesis is true. Now the value of 


2 er 
Уаз, — =)? does not depend on the ord 
1 
of x, and a, 


but only on their absolute difference so we may write 


PHIL? = Prle > ары (бааар э). 

Further, since on the null hypothesis Pr (y, > та) = 4 whatever а, а», we have finally 

РО > y) = Ру > у), 

baie тан X? with v degrees of freedom. This will be recognized as the equivalent С 
For larger k the method is similar. Let Pil, k; а, а, 


m the 
5-48) denote the probability that U^ 
reduced form of the problem consists of 1 means when the д тағаны тұ 
had weights fy; 


riginal observations Wy, Uae c 
С 7505. When there is no risk of i 
write Р(4,5). binis 


9n we shall suppress the 4/8 а 
Тһе probability integra] of Xi may thus be Written 


k 
Pr fs? => ss 
"(XE y) PEU k) Pr {x2 > у p 
Prot = 0) = Р1,4), 


probability that 
ual to Pr {Ж + m 


(B 


where Pr (xt > | l} is the ХЕ exceeds 
means. This is simply eq у 


" w 
given that it was calculated 470 
У} ав we now 


show, 
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An obvious simplification of the notation introduced at the end of $2-1 gives 


1 i 
= 1 1 
where X-XA,XJEA, and var(X)- Af! (4-1,2,...,)). 
1 1 


Two conditions must be satisfied if yz, for given l, is to be distributed like 4? with (1—1) 
degrees of freedom. 

(i) Each X; (i = 1, 2, ...,/) must be normally distributed about a common mean. 

(ii) The restriction X, > X4 >... > X, must be irrelevant as far as the distribution 
of Xt is concerned. This amounts to showing that the X's may be treated as if they were 


independent. 

The first condition is satisfied, 
апу X; does not depend on the magnitude of the v 
therefore be regarded as being selected at random. | | А 

To show that the second condition is also satisfied, consider the following, algebraically 
equi Р т? 

quivalent, expression for ў- 1 
cá 7 —S2M AY ada: 
= DDAA i) PE 


i>j 


on the null hypothesis, since the averaging which leads to 
alues which are averaged. They can 


If the Х*в were independent then (X; — Xj would be the square of a normal variate with 
Zero mean and X would be a linear function of such variates. When the X's are ordered, 
Ak . 


(X,—X,) is distributed like the modulus of a normal variable (since Xi exi when % a 
Owever, the square of the modulus has exactly the same distribution as the square ot the 


, neces те т 
difference when the sign is not restricted. The distribution of Е. 13 ныне by 
і erie й iction X, > - Я 
the dependence among the Х78 resulting from the restriction X; > Аз? ... 2 Ar 
о 


(4) may therefore be re-written 


ы А.Т” уйуй. 
рыбе o y) = РФ |, sitis * 


Dp (72 = 0 = PLE) m 
r=} the determination of the probabilities Р E). In 


= 3 and k= 4 and a partial result for k = 5, 


The distribution problem thus centres ой 
ariate integral. The following notation is used 


What follows we give complete results for Ё 
Using known results about the norma 


Zi = 0—11 


] multiv: 
(i252. (k— 1)). 

Е .. which is found to be 
Puis the product moment correlation coefficient of z; and zj, whl 


_ щы — for die но Е 
Diii = - аавын | 


=] for %=} 
ра | 
ы 0 for [i-i] > 1 (6) 
ke 3 P(3,3) = Pr {z > 0,29 > 0). | | 
On tk a normal bivariate distribution en zero means and 
he null hypothesis 2; and 22 have known result of Sheppar 


ing the well- 


1 SEPA. 
рв.) = 1[1- 7 


Cor å 
Telation coefficient рг; hence US 
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In the special case ау = а, = ар = — } and P(3, 3) = 1/3!; a result which could have been 
obtained directly since, in this case, all permutations of the z's are equally likely. P(2, 3) can 


be obtained using the known results for  — 2. Two possibilities must be considered: 

(i) £i < £a X(1,2) >a, 

(i) 2 «2, X(2,3) < т. 
Now X(1, 2) is normally distributed with zero mean and variance (а, +а,)-1, so that. the 
event x, < 2, is independent of the event X(1, 2) > z,. Hence the first of these possibilities 
d and P(,2; ара) P2, 2; (а, +a), ag). 
Ву а similar argument the probability of the second is 


P(1,2; a4, аз) P(2, 2; ау, (а + аз)). 
Because P(1,2) = Р(2,2) = 1 whatever the weights, 
P(23)-1-1-1. 


3 
Using the fact that X P(Lk)-1,P(, 3) is found to be 
1 


1 -1 
25 08 ps. 
k=4 P(4, 4) = Pr {z > 0,2, > 0,2, > 0). 

When z, Zə, 23 are distributed in the trivariate normal form with zero means, David (195 
gives the result 


P(4,4) = z (2л — с08-1 p,, — сов-1 Роз — €0871 руз) 

the p,,’s being given by (6) 

P(3, 4) is built u 
and 3 as follows: 


P(3,4) 


29 
р, by a similar argument to that used for Ё = 3, using the results for k 
= P(1,2; a, аҙ) P(3, 3; (a, +4), dg, а) +P(1, 2; ag, аз) P(3, 3; a, (as + аз), а) ». 
+P(1, 2; dg, а.) P(3, 3; ал, as, (05 + ® 
Therefore P(3,4) = 1 [Е _ 6081 ға) 4 ( д засл) 4 ( EM za] 
4 т п т j 


H n 2. б 
Тһе correlations, which are given below, prove to be equal to the partial correlations ребе 
the z's, as indicated. 


pes (а, + а.) ал _ 
(@,-+@-+-а5) (a, + а) | база 


a=- / ait а 

(а, + aa + а) (а + аа.) Різа 
"T J а.(аҙ--а,) 

( 


y+ Ay) (Ag+ а. + а) тәме 
whence P(3,4) — $— (e0s p. + соз-1 D23.4 + соз—1 
equation for P(2, 4) is found to be 


P(2,4) - P(1,3; 0; а», аз) P(2, 9; (а-а, +a), a4) + P(1, 3; dy, 


+P(1, 2; 99%) P(1, 2; ag, 


D313.9)/ 47. 
In the same manner the т 


а) 
аз, 4) P(2, 2; dy, (аъ t03 + 5 
а) P(2, 2; (a, - a), (+ 
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Substituting known results in this equation we find 
P(2, 4) = 1+ [cos раз + OS™ pss]/ 47 
Finally, by subtraction SMS 
Еъ4 P(1, 4) = (соз! ру + COS™ P231 + cos раз.) / 4r — $. 
Maie i ir e possible to obtain the distribution fork > 4 because there is по known 
r(2; > 0,25 202 > 0} when m > 3. Some numerical values and methods 


Plackett (1954) and these may prove useful. For the special, 


have I 

; however, been given by 
e: о y 

ults can be obtained for k = 5. In this case 


uti 
all Important, case when the a’s are all equal, res 
Permutations of the a^s are equally probable and so 
Pr (x, > а> tg > Ñy > Ts) = P(5, 5) = 1/5! = ті» 
for < 5 are all known so the above methods can 
ble 1. I have conjectured the following result 
146 the correct values for k « 5. 


Th РРА 
ч тена ав required to obtain Р( 5) 
for ди The results obtained are given in Та 
1e general case with equal weights*. It yie 


k-1 p 
Рак) = X Pü,k-jPü- ЕШ 
1=-—1 
The proof is straightforward for | = 2 andl = №, but breaks down for the intermediate cases. 


Table 1. Values of P(l, k) when the weights are equal 


120P(, 5) 


2P(, 2) 6Р(1, 3) 


24 
50 
35 
10 

l 


given that there are | means 
by the argument of § 3-1. The 
which are independent of 


121 distribution of Fy 


=. Ў т mo 
in i * The distribution of F,. The conditio: 
Proba >. duced form of the problem, is simply th 
ол abilities Р(1, Е) depend only on the correlatio 


^ Theref, А 
E УР, k) Pr {fan 2 ^ (7) 
2 


at of Fi, NA 
n coefficients Pij 


Pr Gu > у} = 


Pr(F, = 0} = P, E). 


o transform the F distribution to an incomplete 


F 
BA testing significance it is convenient t 
ction giving ОДРЕ 
= DP 2 — 1), 2(1— 1)), (8 
Pr, > y) = È Pa p 0-0- 0—0) (ЫЕ (8) 
TAS Í у—1+@—1)7). 
| a pu S of on p. 48- 


* See Note added in pro 


44 A test of homogeneity for ordered alternatives 


4. CALCULATION OF PERCENTAGE POINTS 4 
Тһе results of $3 give all that is necessary for the calculation of percentage points for € 
and F, statistics. Given the weights a,, a5, ..., a, and tables of the X? integral or incomp a 
B-fanction, it is simply a matter of substitution in (5) or (8). The purpose of this section is 
give tables of percentage points to reduce the work involved in testing significance. 


Consider first the distribution of X3. From (5) we have 


TE (oy see} [na 
Pr {x3 > yl =5 ( а-ы | PANS) UNS + 5 px) ам. 
Jy “Jy 


Table 2 gives the values of y satisfying 
Рг > у} = 0-05 


= 0-01, for —p, = 0(0-1) 1-0, 
This table will be adequate for most practic 
When high accuracy is required it is better 
rather than use interpolation methods. This 


Wm ту. 
al work, using rough interpolation if ce 
to make a direct calculation of the proba Е 
remark applies also to Table 3. 


Table 2. The 5 and 1% points of Xi 


= різ 5% 1% = ріг 59% 1% 

0:0 4:931 | 7:289 0:6 3-715 6:700 
0-1 4-158 7-208 0-7 3-593 6:556 
0-2 4-081 7-122 0:8 3-446 6-377 
0-3 4001 7-030 0-9 3:245 6-130 
0-4 3-914 6-932 10 2.706 5:413 
0-5 3-820 6-822 — =- — 


Table 3 gives the 5 and 


the 
requirement 


1% points of Xi for —p,,, —fss = 0(0-1) 1-0, subject to 

—4 — p3) < Раз < A0 — pl). 1 
ты TOME ;o jn P 
This restriction is simply a consequence of the fact that partial correlations must lie 11 of 
he probability inte 


I 
i буз» Pog the lower half 0? y 
· Values in the upper half are i : 


8 
n calculated, Although Де fof 
= : т env e 
k = 5. Itis planned to 4 and one with 4-way pl 
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Table 3. The 5% (upper figure) and 1% (lower figure) points of xi 


| 
0-0 0-1 0-2 0-3 "E T " 

0-0 5-435 | ж gh 
8746 | | = 

SR 5:372 | 5305 | EX = 
8-676 8-600 

0-2 5-307 5.235 5-160 ЖУ Е 
8-602 8:521 8-436 = 

02 5-239 5162 | 5081 | 4-993 z A = 
8:595 8:438 8:346 8:247 

04 5-167 5-083 4-994 4-808 4-791 Z = Ж 
8-449 8-348 8:247 8:137 8:015 = ni 29 

Hi 5-088 4-997 4:898 4-791 4-670 4-528 E 22 
8:351 8-248 8:137 8:014 7-873 7-709 == БЕ 

ы 5-003 4-900 4.189 4-065 4-524 4-354 4-135 Ж. 
8-250 8:135 8-008 7-867 7:703 7-508 7:244 = 

0-7 4-904 4-787 4:657 4:510 4-337 4117 3-805 3:137 
8:135 8-002 71854 7:684 7:482 7-224 6:846 6-001 

OR 4-787 4-644 4:484 4394 | 4056 | 3715 | 2-706 = 
7:994 7-831 7:647 7:427 7-147 6:734 5:413 — 

0-9 4-631 4-432 4-196 3-883 3-353 = - = 
7-804 7-579 7-303 6:934 6-278 

10 4-931 = = == = = = - 
7-989 e =~ = — 

le a 


Table 4. The 5 and 1% points of Xi for equal weights 


ON OF THE TESTS 
In thi ich the 3? or Г, tests can be applied. The first 
8 secti ata are given to which the Xk { ; d 
Wo Telate i P са а к à and are taken from the paper by uev (1955); the 
thir dia 2 x і continge y ‘anoo table. The justification for applying t e test to2xk 
n artificial analysis of vari bservations in the ith category 


2 f n; 0 
i hgenoy tables is as follows. Барро" Laem to test whether these proportions 
Portion tain characteristic; 
р; Show a certain 


bow that p; is normall 
ее А „ыу бой #0007 L wein assume + mid 
ы If each n, is large and p; p P is the proportion showing the characteristic 
: mine with aped P( E Ve p an inaform suitable for the application of the y7-test 

ole sample. The proble 


5. APPLICATI 
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2 2. 
ith а; = n,/{P(1—P)} ( = 1, 2, ..., k). This approximation is, of course, the basis of the p 
han and is usually considered to be adequate if no expected cell frequency is less than, say; 
Example 1 


» original 
Тһе data shown іп Table 5 are given in Armitage's paper, where references to the Mert 
work may be found. The null hypothesis is that the chance of being a carrier does not dep 


Table 5. Relationship between nasal carrier rate for streptococcus pyogenes 
and, size of tonsils among 1398 children aged 0-15 years 


‘a 1 
Е d tonsils 
Present but wlexged veda 
not enlarged Total 
т ++ +++ 
d 
Carriers 19 29 24 72 
Non-carriers 497 560 269 1326 
Total 516 589 293 1398 
Carrier rate 0-0368 0-0492 0-0819 == 
Table 6. Changes in the size of ulcer crater, 3 months after treatment, 
for patients in two treatment groups (A and B) 
Number of cases with crater 
Treatment Total 
group 
Larger < $ healed > $ healed Healed m 
[= 
А 6 4 10 12 92 
B п 8 8 5 32 
m 17 12 18 17 ee 
Proportion in 0-353 0-333 0-556 0-706 m 
group A 
L —L 


t 
: Е : th? 
on size of tonsils; the alternative that the chance increases with size of tonsils. We not?” o 
the proportions are in the order 


expected under the alternative and hence 2 has the е 
value as x? for this table. 
Thus а = 7-863, | 516 x 293 
Pa = / (516+ 589) (5894503 | = 0394. 


Referring to Table 2 this is foun 
customary y*, for which the signifi 
Example 2 4 

: ‚ В t 
‘atone 5 paper, gives the results of an experiment to compat? 4 
| We test the null hypothesis that the two treatments are equally good, against the айе й 
tive that treatment A is better than treatment В. Under this V crm should © 


. . : 
d to be significant at the 1% point in contrast 3 
cance lies between 2$ and 1%, 


0 
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t : . 
= proportions to increase as we move from left to right across the table. In this example the 
A Oportions are not in the expected order to begin with. Because the first is greater than the 
oie the first two groups are combined into one giving a table with three groups. Thisnew 
е has the proportions in the expected order. X; is obtained by calculating д2 for this 
pooled table, giving _, 
> XE = 5:900, —pı = 0593, —р = 0441. 
Extracting the relevant part of Table 3, remembering that рі; and рь; can be interchanged, 
We have for the 5 % points, 


— 
0-5 0-6 
04 4.670 4-524 
4-528 4-354 


Xi is therefore clearly significant at the 5 % level (but not at 1 96). If the y*-test had been 
Used the Significance would have been between 11 and 12 %. 
Example 3 


_ The following table gives artificial data constructed to 
6086 to an analysis of variance table. 


demonstrate the application of the 


Table 7. Artificial data to illustrate the application of the F, test 


нзр Total 
Total an 
No. observed 12,378 
Sum of squares D 
Mean 

| vec 
29-50 
30-17 


30:31 


at the means will increase as we move from 


Tt is gy : dicts th 
tive predicts 
left to на я by averaging the means of groups IV and V followed 


indi le. The reduced 
УП ТУ d V as indicated underneath the tab. 
› ТУ and V and finally IL, ПТ, IV an : pee 
us 9f the problem гоя ds consists of two groups; pipe ЧО о ат он groups 
5 With mean 30-31. Calculating the F ratio for the reduced p 
F = 3:008, | 
“Мов the value of F, for Table 7. То test the significance we require, from (8) 
3:008 — — 9.1432. 
2= 3.008 +18 
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The following table sets out the method of calculating Pr (F, > 3-008}. As the numbers in 
each group are equal the probabilities P(/, k) are obtained directly from Table 1. 

There is thus no reason to reject the null hypothesis that the groups are drawn from 
populations with the same mean. 


Table 8. Calculations required for testing the significance of Fy 


L440—1), AN —1) 
1 РІ, k) 
z= 0:14 | гё = dB 
| | 
2 50/120 0-1041 0-0916 
3 35/120 0-2775 0-2512 
4 10/120 0-4779 0-4439 
5 1/120 0-6614 0-6281 


Pr (z > 0:14) = 0-1696, Pr(z > 0-15) = 0:1537. 


hie 6. FURTHER WORK 
As pointed out at the end of $4 the greatest need is for tables and approximate methods #0 
simplify the method for testing the significance of Й.. It would also be an advantage led 
obtain values of P(I, k) for > 5. If the conjecture given in $3 above can be proved, then the 
distribution will be completely determined for the case of equal weights. Until this problem 
has been solved the best course appears to be the reduction of the number of groups 
pooling. This will undoubtedly result in a loss of efficiency but even so may be better thar 
the usual y?-test, which takes no account of the order. The answer to this question оГ 
require the calculation of the power function of X. А new problem arises in the analysis : 
data like that of example 2 in § 4. When applying the y?-test it was assumed that treatment 
would be the better of the two treatments. If we had wished to test for a difference witho" 
specifying which would be the better the alternative would have had to be, 


Ну: тү > т„ >... > ту, 
ог My S Mg <... < т. 


Tt is hoped to communicate the results of work on these topics later. 


I am indebted to the referee for suggesting a shorter method of deriving the test cri torio” 
and for pointing out a gap in the original derivation of equation (5). 
о added in proof. Тһе recurrence relation for the probabilities Р(1, k 

5 proved by Mr R. E. Miles. He has also established the more usefu 

РЬ) = [PQ.—1, k—1)  (k—1) РИБ 
for 2 =} < 5-1, which, together with P(1, k) = k and 2 2. 
easily. It is hoped to publish the proof in the 1 
for testing the significance of xz and F, 


P 
) conjecturod in Бор 
1 recurrence rela 
m 
mo 
р k) = (k1)71, determine the probabi 0 
next issue of Biometrika, This result can now P? 
for k > 5 if the weights are equal. 
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SOME PROBLEMS INVOLVING LINEAR HYPOTHESES 
IN MULTIVARIATE ANALYSIS 


By С. RADHAKRISHNA RAO 
Indian Statistical Institute, Calcutta 
]. INTRODUCTION 


We cons; 
ns — у : , 3 
sider p correlated normal variables jj, +++ Yp with the following specification. 


(1) 
ens E(y;) = 4a +... Е in Tas (1-1) 
mm Me $ а 9 
b atrix notation E(y) = Ат, where y 18 the column vector of the random variables 
жала Y s а А B 
“Уыт that of т unknown parameters 71. +++» Tm and A, the matrix of coefficients is 


known. 
ise wheree isa known constant and Ais unknown, 


(ii) Thedi А 
) Thedispersion matrixof y, -Yp 
art’s distribution based on f degrees of 


ut қ 
JN Mie ipee S of A is available and has Wish: í 
The set н ependently of 34. «66s Yp: Тһе matrix A is assumed to be non-singular. | | 
Benerali, -up is thus seen to be that of the theory of least squares with correlated variables 
iyu, to the case where the variances and covariances are unknown, but an estimate is 
Siege The estimates of paranieters as well as the test criteria suggested in this paper are 
unknow y those derived by the theory of least squares, substituting the estimate S for the 
эм, A. Fortunately, the distribution problems present no difficulty and valid in- 
бі а be drawn with the help of the existing t and F tables ү E 
Normal a gang wheresuch a set-up holds is when we have a sample of size n trom а p-variate 
istribution with an unknown dispersion matrix A and the mean of x; the ith 


Variable. ; 
le, is as specified in (1-1). Here 

E 1 1.2 

у= 75 ш) 

n — 1) degrees of freedom. 


variances onf = ( 
сәу, and S we wish to 


and $: 
is н А 
the matrix of estimated variances and co : 
alization of Yı: - 


n 
"nine fea. set-up, on the basis of a given T° 
i) Is " he following problems. | 
с арвошан!оп (151) adequate? Thus, Ну». 
P of individuals at p time points, we may ask whe 
en degree. In such à 


m represent the average weights of 
ther the average growth curve can be 
case the parameters repre- 


fr т, be obtained and the precision of 
Ж? «әйт 


ates 

Р ех i 

Xpressed ? кн general linear hypotheses concerning 
c 


m „шша the specification (1:1), how 
x. m$ Det 
\ | "(€ a class of linear fu 


о | - 
аы d W can simultaneous confidence limits to 


" 
netions of ту, ..., Tm be 
red earlier, such as testing 
f'subsets of variables, etc. 
tion for the present 
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lar cases conside 


uality of means 0 
is the main motiva 


Toblem fy: р 
the em (i) is a generalization of some particu 
bles, ed 


ual 2 
(че уж of means of p-correlated varia n 
, 738: Rao, 19460, 1952). Problem (iv), whe? 
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i se given 
i attempt to obtain confidence bands of a smaller width (see § 4) than ag me 
еа Ere in the case of linear regression such as polynomial growth curves, do: 
: | ы ы B 
at relationships, etc., which are covered by the general set-up (1-1). 


2. TEST FOR ADEQUACY OF THE SPECIFICATION (1:1) 


Let B be a matrix with r, equal to p minus the rank of А, independent vectors 


by by gens by 
»-[ deed. | 
% Зы = Шы " 
such that the vectors in B are orthogonal to those in A’. The computation of B, known а 


М t (Rao, 
deficiency matrix of А”, can be conveniently carried out by the method of sweep out ( 
1952, %1а:3). It is easy to see that the variables 23, ..., 2, defined by 


(2:1) 


и = бу. +... кн 


2, = by sob, 

are such that (2:3) 
i E(z;) = 0, G= 1,...,7) 

independently of the unknown 

Given у, .. 

e1 BSB’ 


parameters c, and with dispersion matrix equal to ғ. a X 
» 5 and S we can computez,, ..., 2, by (2-2) and their estimated d _ he 
= eG (say). Itiseasy to see that С itself has a Wishart distribution base 
same degrees of freedom fas that of S and is distributed independently of z4, ..., Žr 


; f 

s ms à that 9 

The problem of testing the adequacy of the specification (1-1) is the same аз тре 
testing whether the r correlated variables defined in 


(2:2) have zero mean values. 
appropriate statistic for this is Hotelling's T defined by 


E а (2% 
6, Lie a 6-а- рен 2,2; 
with the corresponding variance ratio (Rao, 1952, p.71) 
ы f+l-r (29 
— T 


(C istie Ё 
based оп r and ( f-- 1— т) degrees of freedom. The matrix B is not unique but the statis 
18 Invariant as we shall see below. 
Ап alternative way of deriving the test criterion i 
priate to test the specification (1-1) when A is know. 
theory is the residua] quadratic form 


(2 
min e(y — Ат) A-1(y — Ал) 


А ro 
8 to consider first the statistic a ев 
n. This according to the least 84 


n + ati? 
obtained by fittin east squares, This residual quad" «we 
form (2-6) for known A is distribute А 


(е? 
min e(y — Ат)! Sly Ат) 
obtained by substituting the estimate S for A in (2-6). 

Given the matrix B of (2-1) itis easy to see that 
order (p — ғ, ғ) such that BSF = 0, and with suc 


(у – Art)’ Sa(y Ал) = (Ву) (BSB’) 


| r)” 
there exists a matrix F of rank (p — 
h a choice of F (Rao, 1952, p. 53) 


" 
~ (By) + (Ey — БАт)(Е5Е”)-ЦЕу-ЕАт) | 
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which i iti à 
e 2 es sum of two positive definite quadratic forms with the first expression inde- 
nt of т. By choosing т such that Fy = FA x і 

inb аы. 0. g y т, the second expression becomes zero, 

" min (y — Ат) 5-Цу- Ax) = (By) (BSB’)- (By) = z Gaz. (2-9) 

san the statistic (2-7) is same as 7. defined in (2-4) apart from a constant multiplier. 
P entally the invariance of 7; with respect to the choice of B is also established. 

" e, thus, have two methods of computing the statistic 7} either to find a B and compute 
е quadratic form z/G-!z or obtain the minimum value in (2-6) directly by some other 


method. 
3. TESTS OF LINEAR HYPOTHESES 


I ы 5 + s 
п this section we consider the problem of testing linear hypotheses assuming the 


Specification (1-1) to be true. Let 
1171 +... + ИтТт = ё), 
А (3-1) 


sE 
Ug Tide UkmTm = Sk 
wn, be a set of linear hypotheses to be 


ori . а 
Їп matrix notation От = Ё, where О and Е are kno 
ose that there exist linear functions 


mataj, The rank of U is taken to be k. Let us supp 
THD Янь OF у, ..., Yp Such that 
E(z,.i) = 44171 ToU Tm: G =1, es E) (3:2) 

А linear function Uy Ty +... FU ‘tm is said to be not estimable ifit cannot be expressed as an 
expectation of a linear ——A ot the variables yi. p A linear hypothesis cannot be 
t estimable, since such a para- 


te H : ae 
ee when the parametric function associated with it is not est пе 
etric function can assume any arbitrary value without altering the probability of the 


Observations, In the case of a set of linear hypotheses such as (3-1) each of the associated 


linear functions must be estimable (Reo, 1952, p. 83), the condition for which is that there 
exists a matrix L of order (k, p) such that U — LA, where A is thë matrix of (1-1). | 
The hypothesis (3-1) together with the specification (1:1) 18 equivalent to the hypothesis 
E(2,41) = Е, ЖЕРЛЕУ? = & (3:3) 


Е(21) = 0, s B2) = 0, 


endent estima 


(3-4) 


tes of the variances and co- 
bution on the same degrees of 


given that 


У 
here 21,...,2, are as defined in (9-2). Indep 


Variance Wishart distri 
S of z,, ..., г, derivable from S have a Wis ; | : c 
Cedom, не рана of testing а hypothesis such аз del (3-4) was discussed in 
Parlier riate test is as 10 ows. 

papers (Rao, 19460, 1949). The approP terms of yr «++: Yp- The estimated 


et А ux i sexa icd , 
D bethetransformation matrix дш = E H (вау). ТЕН- = (A) is the reciprocal 
ды ations of all the (r+ k) variables is 


Vari 
та. Covariance matrix of 21, «o 5+8 gie, 
rix, then the statistic for testing give? values of exp 
[4 — E (2))]- 


e r+k d " 3 
qo = 7 | Ы [= E(%)] 9 
jt fned in (2-4). The statistic for 


(3-5) 


T ; а 
he Statistic 7! for testing the expectations (3-4) is already de 


esti 
ng the hypothesis (3-3) given (3:4) 18 
уча" Ты (3:6) 
EM wm 
4-2 


` ivariate lysis 
] n пеат theses in multivariate апай 
roblems involving linear hypo 
52 Some pro 


52. p. 83). 
o 5 ree Rao, 1952, p. 
i d — k — r) degrees of freedom ( i E | 
Е" 5 tio based оп k and (f+ 1 g а 
ud ma hose dis value of (3:6) would be the same for all choices of line и бі шіні 
һа: ы ; А | riae 

= imc ii that their expectations are as in (3-2). The optimum properties 
е 0. m di i lier paper (Rao, 1949). -— 

ype (3:6) is discussed in an earlier pay ; | n—— 
wt Abd method of computing 7", „, without necessarily making a transfo 

na Ls 
new variables, is provided by the following equation 


7) 3 
371) 
7 с "5-Цу-А ) ( | 
Т, = Min (у Ат) 5-Қу жу, 
€ 
where the minimization is subject to conditions (3-1) on т; whereas 
(3-3 
іт = min (y — Ат) S-(y — Ат), 
e 
the minimization being unrestricted. mamania y the 
A third method of computing 7, ог directly the difference Т,» =. 1їв puo разной 
theory of least squares (Rao, 19464). This makes use of the estimates of т a aaa 
minimizing, unconditionally, (y – Ат)” 5-Қу- Ат) for which the normal equations o9) 
A'S-ly = А5-А2 іе, Of 
'"q- if it exis 
where ĉ represents the estimates. Let C denote the reciprocal of A'S-!A, if it e 
а pseudo-inverse as defined by Rao (1955), and А "T 0) 
| dm uates uh. (1,0). | 
4 
(5 
UP fle) = (а-у (uev (a -& " 
«oy hand 8! 
Where U and are given in (3-1), Substituting for the numerator in (3-6) the right-ha! 
expression of (3:11) we find that 
43 | 
GF 1-k-r)e 4-5 (UCU')-1(a . gj (3 
Af + 
has a variance ratio distribution on kand ( fl —Ё— т) degrees of freedom. 


| 
4. SIMULTANEOUS CONFID 


ENCE LIMITS TO A cr, 
Let us first consider 


ASS ОЕ 
the problem of Setting up c 
metric function "9 


ns 
гомстто 
LINEAR PARAMETRIC а pa 
. e 
nfidence limits to a single estimab 


mu 


А =. da | 
Using the notation 


"ra 

(312) that if g — шуъ. Бит" 
D 

ЖЕЛІГЕ j 

| G1 + Ty 
has a t-distribut; | 
а distribution on (/-т) degrees of freedom, Hence the confid limits to А Е 
> Ontidence lim 


3) 
day Wane = а | 
= ifa em) 
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In particular, confidence limits to any single parameter 7, can be obtained by choosing 
A = т; provided it is estimable. The confidence limits to 7; are 


2+ tun ү 
611 — Я +. 
- m (1 +1) (24) 
where с;; is the ith diagonal element in the matrix C. 
Now we let the veetor u belong to the (< т) dimensional subspace spanned by the 


vectors 
S [Un ++ “іт 
Эс... (4-5) 
ША тек Ue, 
di— À; = (кат uu m) -A (= les). (4-6) 
Then it is known that (Као, 1946a, 1952, p. 85), 


and represent, as in (3-10), 


(u’t —u't)2 j s 
na = (d-2)' (UCU')-1 (d — 7 
i u'Cu ( i )?(d—-2A) (4-7) 
When u is restricted to the subspace (4-5). Using the distribution result (3-19) 
| (а? ы шт)” (7+1 Бе, UNT r) е 
ud ETT ^ fk m) (4-8) 
has an F-distribution with k and (f-- 1 — k— r) degrees of freedom. If F, represents the 
€-value of F corresponding to these degrees of freedom then the probability that 
(а4-шт” , MT) 
wou “АА PR) 
for all u in the subspace spanned by the vectors in (4.5) is not less than 1—«. The simul- 
aneous confidence limits for all u’t where u' belongs to U is thus provided by 


шт... uu T, JE aao). (4:10) 
The argument employed here is the same as that used by Tukey (1951), Scheffé (1 953), Roy & 
Se (1953), Hoel (1954) and others. It may be observed that the first attempt to obtain 
Such Simultaneous confidence intervals was due to Hotelling & Working (1929). The result 
+ es differs from that given by Hoel (1954) in that, in effect, he bases the confidence interval 
а e distribution of F on p, the number of variables y, and (f+1—p) degrees of freedom 
Gages of the dimensionality of the subspace of vectors u’ for which simultaneous 
ence intervals are intended. Unless the factor (1 +71) in (4:10) is very large, which may 
арреп when the specification (1-1) is not correct, in which case the procedure suggested is 


N . 
к ot valid, the confidence interval based on F, with k and (f+ 1 — 5 — ғ) degrees of freedom 
ould be smaller in length when k < р. 


5. AN ILLUSTRATIVE EXAMPLE 


erosut of the preceding sections show that the computations involved in tests of hypo- 
богы, = estimation of parametern are the same as in the method of least squares with 
miis variables, although the distributions are slightly different due to the estimated 
Ston matrix being used in the analysis instead of the true one. The computations can be 
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: res with correlated 
i ically as in the method of least squares wi 
a rs tme ot у ke this section self-contained for the use 
ariables. We may briefly review the methods to make £ 
"Wi * © x 
of those interested in the computational techniques. 
‘onsider the quadratic form (54) 
Consider q ged Sig Rn) 1 
; : cS al ог! 
ith the estimated matrix S for the unknown dispersion matrix and obtain the 1 
wi 
i 5:2) 
equations А 8-у = A'S AR. ( i 
А В imate 
Find a solution € and a matrix C of the variances and covariances of the pe té 
according to least squares theory assuming S to be true. The matrix c is the m pa net 
matrix A'S7!A, and is unique if the rank of A is equal to the number of unknow n i cr the 
or à pseudo-inverse (which need not be unique) otherwise. The minimum value 
ion (5:1) is given b 5:3) 
expression (5-1) is giv y "— ( қ 
; ; 5s ferre 
The minimum value (5-3), ©, a solution of the normal equations, and с, which will be ont 
to as a least squares variance-covariance matrix, are used in obtaining confidence i 
and testing hypotheses as shown in the following illustrative example. | -— 
The example refers to observations on growth of ten rats under a contro | рш” 
considered in an earlier paper (Rao, 1958). The averages of 10 values representing ir 
growth during 4 successive weeks are 
Yo = 245, y, = 27:5, y,—241, уз = 30-5 


А d from 
and the estimated dispersion matrix S, within treatments, based on 24 р.ғ. (poole 
other treatment groups also) is 


24-2625 1:7708 — 2.3125 — 3:1083 
25-3750 26-1042 14-3542 

43-6125 19-1250 

35:5417 


Inthenotationof$1,e = 10and Ј 


omia! 
— 24. Let usexamine whether a second degree poly! 
in time can be fitted to the first 


the? 
differences of the growth curve or in other words whe 5.4) 
E(y) = ®+#%+у (t= 0,1,2,3) 

80 that r = 


effi" 
4—3 = 1. The unknown parameters are о, # and y and the matrix А of co 
cients is 


, 


100 
ІІІ 
954 
139 


corresponding to the expectations for t = 0, 1, 2,3. 
The first step is to obtain the normal equations 


ої 
,wi 28}! 
» by the square root ог Gauss-Doolittle method (Dwyer 
Rao, 1955). 
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Table 1. T'riangular reduction of matrix S, with y and А appended, 
for obtaining normal equations (square root method) 


| 

Row | 

no. Matrix S y Matrix A 

(0) (1) (2) (3) (4) (5) (6) (7) (8) 
— 

0-1 | 24.2625 1.7708 —2:3125 —3-1083 24-5 1 0 0 

0-2 — 25-3750 26-1042 14-3542 97-5 1 1 1 

og — — 436125 191250 |241 1 2 p 

04 = cs — 35-5417 |305 1 3 9 

Ll| 4925698 0.359502 —0-469477 0-631037 | 4-973914 | 0-203017 0 0 

12 — 5.024516 — 5228957 2901983 | 5-117282 | 0-184498 0-199024 0-199024 
1-3 — = 4-006257 — 0-912171 |-0:080604 | 0-032591 0-239451 0-738670 
l4 5.088214 | 3-706094 | 0-110642 0-433160 1-522860 


The rows (0- 1) to (0-4) contain the matrix S, the vector y and the matrix A. We consider 
the entire matria which is now in 8 columns and rows (0-1) to (0-4) and reduce it step by step 
by the Square root method till the matrix S is reduced to the diagonal form. In this case only 
4 Steps are needed. Тһе figures in columns (5) to (8) in the rows (1-1) to (1-4) containing the 
reduced matrices are used for further computations. We define the product of two columns 
as the sum of products of corresponding elements taken from rows (1-1) to (1-4). Thus 
(5, 7] = (4-973914) (0) + (5-117282) (0-199024) + ( — 0-080604) (0-239431) 

+ (3-706994) (0-433160) = 2-604883, 
[5,5] = (4973914)? +... + (3706994)? = 64674697. 
The nor i 
mal equations are a[6, 6] 216,7] 6,8] = [6,5] 
ат, 6] + AL, 7] 717, 8] = 17,5), 
als, 6] + ALS, 7] + 18. 8] = |8, 51. 

Computing these products and writing the equations in a form suitable for further com- 

Putations we have 


" в Y 9 
2-361459 
| реет 0-229286 
0-088560 M 0-876129 2.604883 
-— 2-904349 6-604173 


Where Ovwith dementa 0.,0,0 stands for the products of column (5) containing the vector 
: 3 Vp Vy? В ісі 
М the others, The reciprocal of the left-hand side matrix 18 


22-867188 — 96-251429 6:113767 
С = 79:444013 — 21:892726 
р 6.465838 
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and by multiplying the rows of this matrix with the column О we obtain estimates of x, J, у as 
= 25.994127, [= 0-368153, 5 = 0-110708. 
Тһе elements of the matrix C represent the variances and covariances of the esti = месе 
when $ is the true dispersion matrix. They have to be used slightly differently roth me: 
the precision of the estimates in the present case. The miniy m valne of (y — Ax) n тет. 
із(5,51-20,-/0; —9Q, = 1-6005. If the object is to obtain this minimum value only se 
is not necessary to obtain explicitly the solutions of the normal equations. The tr iang s 
reduction of the normal equations, say by the square root method, gives three values in E 
column for Q, the sum of squares of which is equal to the expression 2Q, + 0 nona 
estimates of parameters and their precisions are also required the method suggested ab 
may be followed. а 
Tes Jor specification. In the present problem, it is specified that the average ne 

growth curve of the population of rats is a second degree polynomial in time. To test wh 

this specification is adequate or not, we use the statistic (2.5, 2-9) 


(f+1—ne min (y — Ал)’ 5-0(у — Ax), 
rf т 


із > present 
which is a variance ratio with degrees of freedom r — 1 and (f+1—r) = 24 in the pr 
problem. Тһе value of the statistic 


2410 5) = . 5 e М 

2 yr (1-6005) = 16-005 

exceeds 4-25, the 5% point of F on 1 an 

cation at that level of probability. the 
It may be observed that if the object is only to test the adequacy of specification (5-4), 

alternative method of com 


in $2 
puting the matrix B and the quadratic form (2-4) suggested in $ 
would be simpler in the present case. The matrix A' is 


қ specifi- 
d 24 p.r., thus indicating inadequacy of the spec 


1 111 
0123 
0149 


. . i is 
and by inspection we observe that the matrix with rank r = 1, which is orthogonal to А 


B-(-1 3 -3 1) 
from which we compute 


а = (—1) yo+ 3y, — Зи. + y, = 16-2, 
BSB' = 163-9081. 


Since there is only one linear function 21, 


an 
refer 


c 
the appropriate statistic is Student’s t. We 


» it does not provide 


u 
nstants, which eded in 3€" 4 
practice. But if the sole object і ch may be ne 
method may be simpler. 


NT . + | 
Precision of estimates. In view of the fact that the assumed specification is not found ut 
adequate by the above test the problem of providing estimates of the parameters % p. 


MMÁ 
„— - 
————s à 
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their isi 
precisions H 
s does ris J 
Риты den ise. We need to look for an alternative specification which will b 
2 ata. Ё 5 1 1 
Кет sedin tho fel] s ut for purposes of illustration the computations laid out eames 
a meaning only wl owing, bearing in mind that the estimates and their precisions ac i 
g mee : Ж. ape 
The Mis теңі when the assigned specification is not rejected by a test 5 
ice interval for a par i i 
зат: у is gi 
a parametric function 2t, 7; + --- + Uy Tm is given by the formula 


(4:3). ; 
). Obtain the least squares estimate 
d = 47 Teck t T 


and le 
ast Squan : 
ares vari = i i i 
ariance, g = u'Cu. The confidence interval is 


ЕНЕ 


Where e 
T- j min (y — Ат) S (y – Ат). 


al for the parameter 215 desired the formula is 


In the ^ 
present example, if a confidence interv 
" (/—1)е ү” 
&tll—5 ý 
(a tT) 


nent in the matrix C, is the 16 
e have the limits 


ast squares variance of the 


Where e 
1» 
25-9941 + 4-1263, corre- 


өзінді m first diagonal ele: 
о. Substituting the observed values W 


8 : 
Ponding to 959 
specification to be true, let 


us Consi 

sider : : 

alues osi ш problem of obtaining confidence bands to the polynomial a + ft 4- yt? (for all 
ree-di the time). Although vectors of the form (1,0, #2) do not cover all the points of 
P, vis = ensional space the probability statement (4-9) applicable to all combinations of 
alue is ill true as an inequality. For an ar f timet the estimated polynomial 

a+ pt yt? = 25-9941 + 0-3652/-- 0:110712 


о probability. 


Сопу 
fidence ; 
€nce bands to the differential growth curve. Assuming the 


bitrary value о 


and t 
һе 
least squares variance of the estimate is the product 
1 z 
(1,4,8) O| | = 22-8672 — 52.5029 + 91.6715 — 4378551 + 646581. 
12 


The 
otl 
her values needed for an application of (4:10) are 


k=3, f=24, e= 10, 1+7. = 1.6669, Ж, 
owing confidence li 


2307 а = 0:05. 


mits: 


Sup 
Stituti 
ting ¢ = 0, 1, 2, 3 we obtain the foll 
120 еі 4-2 pes 
| зек md 
19-0383 20-6107 
Lower limit 19-6600 pel 27.1731 28:0947 
Mean value 25-9941 26-472 353079 35-5187 
32.3282 33-0581 


Upper limit 


ned by graduation (such as free 
e specification 


t can be obtai 
ands are Và 
liminary testis 


lid only when thi 


Fr 
om t 
han b values the limits to other values of 
Ado oothing) if necessary. The confidence b 
7 necessary. 


ed is Supported by the data, for which а pre 
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TESTS OF SIGNIFICANCE ІМ CANONICAL ANALYSIS 
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SUMM = 
Rim or Y. Results aro obtained concerning the approximate distribution of canonical correlation 
nis. A slight improvement is suggested for the test of significance of residual canonical roots. 


1. INTRODUCTION 
ail the approximate distribution of 
he tests of significance associated with them. In 


атеш; йн 
particular we shall diseuss the approximate test of significance proposed by Bartlett (1938, 
let us say, the first Ё roots, corresponding to real 


utm ipm canonical roots when. np ы 

rather na etween bwo sets of variates, have been remove a с а qum " ee some 

disp. N us and laborious algebra, but we have been una в o fin any У тия more 

Presumed 5 attempt has been made to justify s expansions in series; they are 
o be valid in an asymptotic sense. 


In thi 
u » B H H H 
8 paper our object is to investigate in some det 


canoni Е 
ical correlation coefficients and t 


the variou: 


L 2. ASSUMPTIONS AND NOTATION 
Em sets of variates containing respectively P and q xp м by 
ну жү and wy, the corresponding row vectors being a; and жә. iun | эз зө edin т 
Чоп ii relationships between х; and %2, and we shall begin by consi lering | e true correla- 
"1 case where all variates follow 8 multivariate normal distribution with zero means. 
‘thout loss of generality, since at most only а linear transformation within each set is 

А 


Тест ^ А . : 
quired, we may assume that their variances and covariances are given in matrix form by 


Бауш) = Blea) = fl s 


Еа”) = Р, Е(®®) = Р, 
d where P = [ра matrix with p rows and 


that ри = Pi is the ith canonical correlation 
и 


Whe 
ere Z, denotes the unit matrix of order p àn 
hat the first (<, < 9) of the р; are 


4 со 
Coe lumns such that p;; - 0 НЕНІ d such 
isting s nt in order of magnitude. We shall assume | 
: iti i ero. - WES А 
апа positive and that the ane Pe т ST "T VE PM 


о К 
"responding to ће above true values freedom, ате given қ 
© variances and covariances, with n degrees of freedom; x 
cox А» = 7. Xa Е Q 
Аһ = е Ө м = ү = Р'+Хәг 
= 2 T Р 
Thy e Pon 1 estimates from expectation. 
S the X-matrices represent deviations of so relation coefficients are the latent roots 
š * i с 
ft 98quares 72, ..., 72, ofthe estimated canonica 
A(Pa Xu) (2) 


P +a) E+ xu 


tically Zero- The usu 


? лаж Қ 
тер < Ф the last q— of these roots BT” ide jn their population та: 
«de and that with high pro 


Зата 

y ts W1 

hows, to the correspondence of these 09 = deam 
ег, that they are arranged in order О 


© matrix 
al difficulty arises with 


lues. We shall assume, 
bability r? then 


А Ag Ant А = (1+) 
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; ; — 
ds with p for i = 1,2 k. It would be difficult to investigate the point w den 
à e og: Р бад ВИ 2 
[uie элш but it seems likely that, roughly speaking, we are justified in this ass ot 
а that the sample size is reasonably large and that the first X of the p; are ne 
ion 
very near one another nor small. 


Let us now expand the matrix product (2) in the form 
U- Xa Xb- (P+ Xa) -Ent XS.) (P4 X3) = РР+У, 


5 
nd ix Y тау be expressed & 
where Р’Р is a diagonal matrix with elements p2, sp The matrix Y may be expr 


У = YO: yey У®+..., 
where YO = XI — PP) - Xs, | (3) 
rYo- -Xoll — P'P)4- X Xs t+ Xa X; J 
рғы TE —P'P)— Xh Xs — Xu Xa X, Xa Xa Xi ete., 
Xs = Xg-P'Xy4— X4 P4 P'X4P, 
Ха: = Хв = Хн-Р’Х и. 


Е satas given DY 
Tn order to interpret the above notation let us introduce a vector 2; of variates give 


and where 


S3 = %—P'ay. 
Thus x, represents the component of x, 


d 
P variances aD 
which is independent of >. The variance 
covariances involving x, are then given by 


E(z,25) = E(z422) = I- P, 
(аз) = 0 of 
; я M А jations 
The matrices X, and X. з are to be interpreted as representing corresponding deviatio. 
sample estimates from expectation. 


(4) 


matrices. The latter can be — is 
and by the use of the relations (1) and (4). The alge? joal 
so we shall not reproduce it in full. Let us denote the tyP 
Y Уу and yf respectively (5j = 1,2...,q). Then 
ы = yD + y2 ruo 
and we find, first, that об дн 


Elya) = (1-р) (p— 9 — pin + O(n), 
where 


k 
dic Жр]. 
For i + j we have Ву,;) = 0. = 


Proceeding further, for i +j, 


EUY jo) = (1~pi) (1—03) (gà t pj)ln, 
Pw YR) = -0 -pl) 1. — p3) {q+ 2) pi рп, 
Буур) = (1-22) (1-0) la — pa, + 1) pic( 
EYP y®) = (1 — pi) 0 — pl) p — 22, a,) + ( 
where dy = Soh, 


8-1 


we have 


2 
a= DPI PE + ph + 22031" 2 
“4-26 + p?) + 2(p4 + pt + 0203) 
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We hence find that 
Е(у,у;; == AB 2 PEN 
Узун) p) (1— р) = (p + p3)[n + ((p — 2a, + а.) + (3a, — 2p +4) (p? +p?) 
ir, 
T— +2(2p4 + 2p} + 3рїрў)}/п* + O(n). 
na similar manner we obtain the following results: " EA 
var (y,.) = 4p? 2 2 
(у) = 4p3(1 рт (1-18) (q —a,)[n (1-2) (p — q — 2a, + 4a) 
| +2(ба, — 4p — 11) p+ 39pf/n* + O(n* 
Sd = 24pi(1— pi (1 — 3p3)/n®? + O(n-), | " iie 
AI (у) = 96p3(1 — рф (1 — 293) (2 — 1309/1? + O(N). 
80, for i 4: j, we have 
var (gy..) = о\ 7-2 2 2,2 
oe = (1— pj) (93+ pj — pipi m +O), 
Ун) = (1-03 2 5(p2-+ р? 5р? pn? 
60) = (1-23) (1 ~ p3) (— Бор ++ рў) + St pn + Om), 


COV (y; 242 2 д,2 2 о 
Uo Uad = (1-9 (1-2) (1+ BP P7 8pi — pi — 9pipj)[n* — (1 — pi) (1 —0*]n* 
3 O(n3). 


RIBUTION OF THE FIRST k ROOTS 


ME use of an expression for the first k latent roots of the matrix P’P + Y. A similar 
the intor was derived in a previous paper (Lawley, 1956a) in which we were concerned with 
aiv ernal analysis of a single set of variates. The sth root of the matrix (s = 1,2,..., k) is 
en by 
8 = р? vv aiis y yy Ун + Y и — Oy" 

—— = кү; He (а-а Р-Р) ies 
of the " indicates summation over all values from 1 toq with the exception of s. The validity 
Bre be. mi depends upon the assumption that the y-variates, which are of order 1/A/n, 

all compared with the quantities (2-0 } : 
val means of this expansion and of the results of the preceding section we are able to 

ate the first four cumulants of 7$ and hence those of r, We find that 


Р қ Ta: E 
3p,E(r,—p,) = (1-28) р+4-8-0+21- 2 КОД ) 


БЕНЕН, a 
ky) = = 66.0 — p?) m? + 0097 ) 
= Led po (1-1208 +0097) 

on to the variance of ,, since the 


depends to à considerable extent 


4. APPROXIMATE DIST 


Tn ку» 
general, it seems useless to seek а better approximati 
The variance 


erm 
Чо de order n~ is extremely cumbersome. wp fam Eod Әшен 
aly on the value of p, but also on the differences ps — Pi 18,21 > сопа @- 


і à Ў % 
е А ; „zero canonical correlation coefficient. 
case where № = 1, i.e. where р; 18 the only non-2! 


Ore what restrictive assum tion 

general Е sider 7 under the some Шр 

at the first 2 > т. ES 1, A pon F is moderate. For these cases, writing p in 
— 1 of the p; ar 


се ar unity while Px 
of Pr ме hase 

E(ry) x p+ 30-0) (mp Р)! rut 

Var * R (і ipa +1 py 1109—20" 1) трт", 
b] ** 2 

lti т = p+47 2. uu 
оғ 15 natural to apply the usual z-transformation in ке 
9 canonical correlation coefficients; put in genera 


Where 
malize the distributions 


der to nor: 
f little help 


his is unfortunately © 


Tests of significance in canonical analysis 
64 , 


<pecta fa f e t -2 о ted We 
in p е: 1 2a лу be neglected. 
e i | order less than n? and m: № 5 
і tions of all other terms are о 
since the ex 
find that 


E(tr УФ) = (0—0) {(q—k) (p 2a, a4) + (p-a) (p a, + A 
pe as шш иона жу уте, 
Е (Yo, D3Y,,*) = (q— E) ((q— E) (E— 2а, а) (6а) (Еа 3-1) 
and hence that E(tr 22) -(р-2) (4-1) (p--q—23k4-1 )/n?. " 
Тһе expectations of tr 73, tr Z^, etc., are all of order less than n~ and may be перес 
Hence, finally the expectation of (7) is found to be 


(p — к) (q — k) (1n 4- (p+q+2k—2b+ 1)/n?} + O(n-9). me" 
inlving factor 
Thus, supposing that the values of pi, ..., py were known, the correct multiplying fa 
the criterion (6) would be 


a 
j= 


n-k-Mpq41)4. S (1/pj). 
ae yqq- r) + O(n) 
This would not only make the expectation of the criterion equal to (p — E) (q var 
but would make the other moments equal to those of y? apart from errors of order 
à result established elsewhere (Lawley, 19260) 
Ifp,, ..., py areall fairly near unity, the multiplyir 
which is the value suggested by Bar 


be obtained by taking the factor as 


pq t 
ng factor isapproximately n — (p +4 


ly 
"ен 1mab б 
tlett. Otherwise an improvement should, prest 


k (9 
®—Ё—+(р+@+ 0+5, (1/r$), 
ігі 


though we cannot claim this to be very firmly established. 


6. THE CASE or FIXED INDEP РР 
т 
Let us now Suppose that x, independent variates an 
dependent variates. Then the c i 
® оп 21. For given т 


ENDENT VARIATES 


и = Pi, where Pigre 
remaining fj; are Zero. We 


I and OF 
(with n degrees of freedom) ces which we denote by C. баз On? 
It will be convenient to consider the matrix = 


(9) 

(Cop — Cy, OF} Ci) 0, Ci Os, 

Which has for its latent Toots the quantities R= 71). Where { = 1,2 q. the P 
Estimates, with m = ^ — p degrees of freedom, of the variances - covariances ef 

(for given ж.) are Provided by the elements of the 


Matrix 
1 


im (Ca Cun CFCs) = үү, 
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whi қ we ; 
т elements of V represent deviations of these estimates from expectation. To 
6 P ify matters we may choose the x, again without loss of generality, in such a way that 
и = n, Then we may write 

1 

Zo CFOs = ВХ, 

т 12 и 12 
resent deviations of estimated regression coeffi- 
f pg independent normal variates each having 
ibuted independently of the elements of V. 


wl 
vg the elements of the p x q matrix X rep 
ses Sfrom expectation. They constitute a set o 
ера and variance 1/m and they are distr! 
9 matrix given by (9) may now be expressed in the form 
(1+ уве X (Be X) = U-V& V- (BX) BX) = В'В+Ү, 


w „э: iaa 
here B'Bisa diagonal matrix with elements / (4 = 1,2, .... d). The first & roots of (9) 
Provide estimates of f, ..., ffi, while the remainder enable us to test the hypothesis that 
кер ...,Д, are all zero. The matrix У may be expanded as before in the form 
y= YO yor Y9-... 


b 

ut now ya = BIX+X'B- VB'B, 
ye = Х'Х- VEX-VXBt V2B'B, 

yo = —VY®, 

yo = ҮЗУ, ete. 

of the preceding section. Then the q — k 


Let 7 
US partition Y int с manner as the P о | 
Smallest roots of ре E coe approximately the same as those of the matrix Z, where Z is as 


oe 2 à 
Th өп by (5), provided that we now define D as the diagonal ma = Rm s i 
latent roots ofI--Zare approximately equal to the quantities Lenis Б 

a 2 
nce the criterion (6) may in this case be expressed as 


3) — 
ав (22) +6609) 
log, 1+2] = (0-3 кет to recaleulate th 
"a Order to evaluate the expectation of this expression de : и a тает 
. Peetationg of the various products of elements 01 > ғ q) and neglecting quantities of 


= 0(k <8 $ 
order, Process. Assuming throughout that дейі 


717? we find t т? 
nd that йыз pim+plat 1m , 
By) = (PtP TUS 


(10) 


№ k o 
here " = X 
Тор; | oq +2) т» 
i (pM * 
8 we have Ералы) m film {P а 


Byatt En +4)+ (q+ 
Ву) = 2)/m? 
SA а ИУ 3+ fl 2 
Ву) = А " $ Віт? 
ang E(ys uidi) = 77" 

> for %4 


Ем MM 


s all unequal, М 2 g2)m?, 
НИЙ 


E(ys a i9 gs fe EDI 
(ухи) 7 Кү A жа 
Буы 1" 
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тейен E(tr Yoo) = p(q—k)|m+ p(q— k) (q+ 1)/m’, А 
E(tr У DY) = (9 — k)[m 4 (q — k) (pd + k(p + 2q + 2)}/m?, 


2 
12 
2 


Е( YoY DY o) = (q— К) ((p--q — k-- 1) d - kj[m?, А 
Е(ыҮ D“Y,, D31Y3,) = (q— k) ((p + 25 4- 2) d - k(q 4- 2c + 2)|m?, 
Вы, DY, D, D3Y,)) = (1-0) (6+1) (e+ 2d) т», 
E(tr Yo, DY, Ya, DY) = (q — E) ((q - Id + Ij[m?, 
k 
where d- ха 163). 
j= 
Hence E(tr Z) = (p—k) (q — k) (Am -- (q — k - 1— d)[m?j. 
We have also E(tr УБ) = (q— E) (p(p--q —k4- 1) + дт, 


E(trY, Y, рУ) = (q — k) {klp +g -k+ 1) + oj[m?, 
E(tr (Yo, рУ, )2) = (q— k) {klg + 1) + ет; 
E(tr Z?) = (p— k) (q — k) (p +q — 2k + 1)[m?. 
Hence, finally, the expectation of (10) is found to be 
(p-k) (q — E) {1/m+4(q—p —2d + 1)/m}+0(m—=) 2i 
= (p-k) (q- E) (1n 4(p+q—2d+ 1) т) + O(n), 


and thus 


putting m = n— p. 
The appropriate multiplying factor for the criterion (6) is 


k " 
п-$(р+9+1) + (1/47. 
j= 
e 
If A, ..., В, are all large the last term may be neglected. If, on the other hand, we теріво ў 
each f by its estimate "j| (1 — 72), the multiplying factor will again begiven by expression( 
We are thus led to much the same conclusion as in the true correlation case. 
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THE ESTIMATION OF PARAMETERS IN SYSTEMS 
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1. SUMMARY 


Іп тапу 
папу fields of study there occur systems of lagged dependences which can be described 


approxi ; : à dm 
Ps oat by systems of linear stationary stochastic differential equations. In this 
toti те consider the estimation of parameters in such systems, using an approach similar 
m : of Bartlett (1946, 1955) and Quenouille (1957), and discuss conditions for identifica- 
sam E the separate behaviour relationships of a system. Little is known about the small- 
unice properties of estimates in systems of this kind, but sampling experiments are being 
out and the results will be published later. 

EAR STOCHASTIC LAGGED DEPENDENCES 
tween n variables 2:1) which are 
tion integrals 


2. SYSTEMS OF LIN 


Ав 
st : i 
ба ‘tem of linear stochastic lagged depen 
1 : P М 
ons of continuous time can be described 


dences be 
by the system of convolu 
c(t) = X | 2000) ay — 9) dha£( (G7 b? wong) (1) 

іп wh; j*iJ0 | 
ion each of the weighting functions 10,07) specifies the magnitude of the dependenoe of 
value taken by 2; at time ¢ on the values taken by 27 at times #—Ё, and the 5,0) are 
4 hting function of a lagged 


Stochasti " i 
іс disturbance t Laplace transform of the welg 
e terms. The Lap Tf all the transfer functions 


e . 
on is called the transfer function of the dependence. 
system (1) are rational functions, ie. if 
2 pils) (2) 
| g^ wi; dh = PC 
W] | | 
Ex е Buls) and (в) are polynomials of finite degree In 8» (1) can be written as the system 
fferential equations 
(5j = 1,2,..- 7) (8) 


200) = У Bis) +50) 
gut a;(D) Қ” ан al 
the ith equation 1n (1) is then а. particular 
jnitial conditions are at 


in Which 
: Di . Е СЕ 
is the differential operator ajdt. For Pis stable an d the 


Int 
в and of the ith equation in (3), and if the system 185 (8 
790 it is the general solution of the ith equation In V7, 
b 
аш QUD) be an n х п matrix whose elements 9,40) are given DY 
E if jti " 
(D) = Qij АЕ 
040) i T 
а ; 
Nd let z(t) and £(t) be column vectors with elements 200 and g(t). Then (3) may be written 
(5) 


др) = 80 - 
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fields of study a system like (3) may be used to represent the аша 
= (е ate units which form an interdependent system. Thus the equations в: 
siraan Shay шр of units of physical equipment, as in control engineering, or м 
je pr: coe or firms, as in economics, or of animals or organisms, as in “a : 
iological с ерең In describing real behaviour systems of this sort it is often apu = 
assume that the behaviour units cannot respond to change instantaneously. I С саат 
lenttotheassumption that the weighting functions w,,(h) in (1) do not contain de evt e 
impulse terms,* which in turn implies that the numerator polynomial Fm he 
transfer function is of lower degree than the denominator polynomial aus). In t pi- л 
we shall consider опу systems in which the numerator of each transfer f unction и е Е 
degree than the denominator. It will be seen later that this restriction play á E 
portant part in the identification of the separate behaviour equations of a system. 


3. SPECIFICATION OF THE STOCHASTIC DISTURBANCES £(/) 


| А Рта ; E(t) are 
We shall consider only systems which are stable and in which the disturbances al ie 
stationary stochastic variables whose covariance functions possess rational tv 


Laplace transforms, It will be convenient to assume that the disturbance 


s E(t) are generate 
from another set of stationary stoch 


astic variables ¢,(t) by the transformation 


п fo | (6) 
E(t)= 5 | rill) GU A)dh (6-19, п) 
k=1 J0 

in which the functions Tin(h) have Laplace transforms Fs) of the form 

7) 
7) 45) ( 
Rus) = a 
ale) 19) 


where 77,,(s) and Yas) 


hat of 
Vix(8). Equations (6) 


аге polynomials in s, the degree of "a 


s) being lower than tl 
can then be written as the set of differen: 


tial equations 
E(t) = R(D) C) 
р: 
where R(D) is a matrix with elements E; (D) and C(t) is a column vector with elements ы ps 
In order that the variables 5(#) be stationary it is necessary that all the roots of the equ? 
Yu(s) = 0 have negative real parts. 


(9 


, 


We define the following covariance functions; 


(т) = ВЕ т}, (0) 
V) = EKOE E-r), 
Ат) = ВА ет) 


* The 6-function or unit impulse function, ó(h) 


- 0 if kh +0 © 
8(Ь) = йз: ЖЕ nio апа ЕСІ & 1], 


is defined by the equations 


(See James, Nichols & Phi 
p mp ia p (8) the partial fraction ехрапвіо Я 
Pas) а4/(8) wou со у and so its inverse transf h) would cont 
6-function impulse term. orm w;;(h) 
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and de , i S 
note their Laplace transforms by 6(s), ¥(s) and A(s). Th 69 
, . Thus 


As) = jJ етт) dr, 


Ч ($) = P ST) 
(s) | ет) dr, 4% 


-шо 


A(s) = | © e-stA(r) dr. 


J—o 


Tt wi 
will now be shown that 0(s) is given by 

Ols) = R(s)Y (s) В'( —5). 
efined but zero for h < 0, and the lower limit 
then becomes 


(11) 


The fu а 
of Jem ra (h) in (6) may be regarded as d 
ion extended to — o0. In matrix notation (6) 


0) = І hy e(t—h) dh, 
multiplying (12) by £'(t—7) and taking 


(12) 


(Ж) bei 
n қ 
береді the matrix with elements 7.0). Post- 
ons we obtain using (9), 


Mr) = | = АЮ. 


-0 


(13) 


Takir 

hg t 

$ the Laplace transform of (13) and remembering that the transform of the con- 
forms* we have 


Vo. ution 
of two functions is the product of their trans 
Also бө) = Ris) M9). (14) 
> Postmultiplying (12) by £'(t4- 7) and taking expectations we find, using (9) 


ae) | mre ens as) 


On 
Maki 
ing the substitution 44 = — h this becomes 
(16) 


A'(r) = L =) рт) 
and it is 
зь follows that A'() = Rl —s) ¥"(s)- (17) 
Stitut; » 
" ami the transpose of (17) in (14) we obtain (11). — 
Ow postulate that W(s) be identically the unit matrix, 
Subst у(в) = 1. Ро 
Stitut; 
ution of (18) in (11) then gives 
(19) 


o(s) = R(s) R79) 
ons of a set of variables can be 


al density functi 
of their covariance functions by replacing 


for the theorem & 


he conditions : 
vem ан the roots of the equations ya (8) 


Si 
Nee t 
bt hes Spectral density and 
from the two-sided La 


cross-spectr 
place transforms 
re satisfied as а result 


- 0 have negative 


ж 
8 
of ү, ee үү; 
th idder (1946), p. 258, theore 


Pen; © Stat: 
al at; s 
рагы arity of (4) and the cond 
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the complex number s by the pure imaginary iw (the Laplace transform then d 
a Fourier transform), the spectral densities of the variables €(¢) аде ЗОН Or 5 dea 
quencies w and their cross-spectral densities are identically zero. The matrix ve) o dida 
covariance functions is diagonal, the leading elements being 6-function impulses. warte Е 
implies that the variances are infinite, the variables С() are not physically шаша "i 
However, (6) remains a valid mathematical model for the generation of the et 
realizable variables £(t), whose spectral density and eross-spectral density functions, given y 


"ee (я of 
(iv), are proper rational functions of iw as a result of the restriction that у;,(8) in (7) be 
lower degree than y,;(s). 


4. COVARIANCE FUNCTIONS OF THE SYSTEM VARIABLES 24) 
Substituting (8) in (5) we have 


(20) 
Q(D) x(t) = R(D)E(). 


Let P(D) be a diagonal matrix whose ith leading element is the lowest common multiple of 
the denominators of the elements in the ith row of Q(D) and in the ith row of R(D), 2-6: " 
lowest common multiple of the polynomials a;(D), j + i, in (4) and y4(D), k = 52,7 
in (7), and let (D) and G(D) be matrices defined by 


(21) 
F(D) = P(D) Q(D) 2) 
22 
апа G(D) = P(D) R(D). e 


Then premultiplying (20) by P(D) we obtain 


23) 
FQ)«() = GD) C(). M. 
о 
The elements of F(D) and G(D) are polynomials in D, the diagonal element in any row 0 
F(D) being of higher degree than the non-diagonal elements in the same row and 2180 | 
higher degree than the elements 


in the same row of G(D). Thus the determinant |700) 
cannot vanish identically, and we may write (23) as 


w(t) = F3(D) G(D)C(0. e 

Let Ф(т) be the matrix of covariance functions of the system variables x(t), so that 
Ф(т) = E(x(t)a'(t —)y " 
and let Ф(5) be the Laplace transform of $(т). Then, by a similar argument to that used? 
іы ы Фа) = Fle) 9648) 6-9) РЦ) a 
or, since Y'(s) = J, Ф(5) = F~(s) G(s) G'(—s) Е’ (-5). d 

If we define the polynomial p(s) by 
p(s) = |F(s)] 

= По-а) e 


* In engineering literature the variables C(t) 
Laning & Battin (1956), Б 


ge? 
ате called j 
pp. 136-44 and Miller (19 E 


pendent white noise source? 
56), pp. 282-4. 


= A. W. PHILLIPS 
(в) for the adjoint of F(s), (27) becomes 


Fe(s) G(s) @'(—s) F'«(—5) 
р(в)р(—5) à (39) 


Ф(ѕ) = 
Forsi i 
mplicit Ww i à 
of the equati à : nt Ue consider in this paper only cases in which the roots 2,6-1,9 
= 0 are all distinct, and shall also assume that p(s) does rota ite с. 
ntain any 


factor whi 
which is со 
s common to all the elements of (8). Expanding the right-hand side of (29) i 
) 29)in 


Partial fracti 
al fractions we then obtain 


Ф($) = Ў EUM X т а 
r=1 87 r=1 -s-AS pu 
Where 
к. = Coad PG) Gl) @'(—s) F'(—5) 
ПƏТЕР) ИТ. p 


For asi 
simpl š 
ple root À, the matrix F(A,) is necessarily simply degenerate. The adjoint 


Matrix Га A 
(A,) is of unit rank and can be written 


Where ie. j Рч(А,) = „кь PS 
„18 
» is a column vector which satisfies the equation 
and к. is F(A) k, = 9 
718 а row vector which satisfies 
(See Pray, к„Е(А,) = 0. "s 
[3 
r, Dunean & Collar (1950) 5-6.) Let l, be a row vector defined by 
s— A), G() (= E 
т ЖЕГЕНІ 47-9 din 
hen subst; р(в)р(—5) ші 
ituting (32) in (31) and using (38) we have 
(36) 


Thus th K, = hls 
e ә 
from 85 мк К, is of unit г ank, with columns proportional to the vector k,. It follows 
шады, FOE =0 =? от), uy 
ultiplying (31) by F(s) and putting 


а reg 
ult whi 
“= . hich could also have been obtained by prem 
is obtained by taking the inverse 


netions 1 
unique inverse. However, for à stationary 
real parts and ф(т) > 0287? + оо. It сап 
nsistent with these conditions is that 
nsform of Ф(т) forr > бапа 


-sided tra 
0. The inverse of the first sum in (30) 


The 

e 5 

trensform of ¢ P(r) of the system covariance fu 
i. em the р, (30). Formally, (30) does not have à 
gh ily be ai А, are negative ог have negative 
fh, tined b erified that the only inverse which is co 
the Bond. taking the first sum in (30) to be the one 
9n gi sum to be the one-sided transform for T < 


Siveg 
g(r) = К 


r=1 
we see that ф(-7) 


(т > 0) (98) 
ф'(т), а fact which is also 


hile 

oh. fro : 

b Ous m the inverse of the second sum 
from (25). 
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5. CALCULATION OF @(s) FROM GIVEN POINTS ON ф(т) 


ВИРА » T" А of ron the 
Let ¢, be the matrix of discrete functions consisting of points at unit intervals 

p + 
ee functions Ф(т), i.e. let 


39) 
ф,-Фіт) for (r2.., 2, 1,0,1,2,...), ( 
and let M(z) be the one-sided generating function defined by 


(40) 
M(z) = у ф,2". 
т=0 
Then from (38), (39) and (40) we find that Е 
К, 
Ма) = У ———, 
z11—44,2 
r=1 Г (42) 
where Hr = e. 
m (43) 
We define the polynomial A(z) — I (1—4,2), 


and seek matrices U(z) and V (z), whose elements are polynomials in z, such that " 
М(г) = U-\(z) V (z). 
If (44) is to hold we require that 


(45) 
|U(z)| = A(z) 
п K aa 
and that Uto) 2; 1-е = Ү(г) 
Multiplying (46) by 1— Haz же obtain 
4T 
K ( 
Ut) [K,«ü-a,3x та |= а-ар) 
rq 1-/,2 
and on putting 2 = 1/д 4 = 1,2, ...,m, this gives 
(48) 
v (7x, -0 (¢=1,2,...,m). 
Иа 


Since K, is of unit rank, 


oly” 
(48) provides m scalar equations in the coefficients of the P 
nomials occurring in each 


row of U(z). If U(z) is given the form 


О(г) = I--U,z4- Ог... +. Ц, =, 

where U,, Uj, ..., О, are nxn matrices w 
contain cn coefficients of powers of z ( 
Tt can be shown that if c is givenas 
are put equal to zero, equations (48) 
row of U(z), provided the positions 
degree m. It will always be possible 
satisfied, and equation (45) will the: 
usually, but not always, be possible to 
of the coefficients is put equal to zero 


hose elements are constants, each row of Fm 

excluding the constant terms, which are unity 07 (4 
uitable value and си — т coefficients in each row 9 е ob 
suffice to determine the remaining m coefficients 2 і of 
of the zeros are chosen in such a way that | ДНА p 
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When U( 
z) has be i 
iier en determined Т(2 
erit ied ‹ (2) can be found fr 
я -à ‹ om (46). It ca: 
S ne ie : (=) K, and it follows from this and (46) eum the esa Pod doa ieee 
of U(z). Thus V(z) is of the form uoc 


where И, y, Viz) = Nether. +1021, 

for Vi, sss V у i i 
Posh dee 1 T (opi whose elements are constants. С 

ying (44) by U(z) and writing the matrices in expanded form, we 1 

,we have 
(1+0, 2 
Bison +...4 0,2) (ф+@,2+.-.) = ИИ... that (51) 
ле с ісі н | | 

5 oefficients of 22, 2^*!, ...,in (51) we obtain the difference equation 

(82) 


p FU ate TUS. -0 (rz с), 


from 
Which the i 
matrices U,, Up, ..., U can be calculated if ф, is given for а sufficient number 


of valu 
es of » 
7. Equating the constant terms in (51) we have 
W-$ 
and equati TO E 
quating the coefficients of z, 28, ..., 2271576 find 
(54) 


1 
i= die Х байы ü= d Sot 


from 

whic А 

; Thus сеа ай И, И, -ees Vo- 081 be calculated. 
ünetiong din) : sufficient number of points at equal intervals on the system covariance 
is possible to calculate Га) ала V (2). M (z) is then given by (44), and can be 


*Xpanded i 
ed in partial fi 2 
partial fractions to give (41). from which the matrices K, and the scalars д, are 
p(s) = 0 сап then be calculated from 


Obtain 
ed im { 
mediately. The roots A, of the equation 
(55) 


A, = 06» 


then be found from the K, and A, using (30). 


Which f 
oll 
ows from (42), and the matrix Ф(в) can 


WHEN ALL EQUATIONS 


p 0(8) FROM Ф(в) 
ARE OF THE SAME ORDER 

(20) or (23), in which all the scalar equations of 

nd which cannot be reduced to an equivalent 

ments on the leading diagonal of F (s) 

degree of the numerator 


6. 
CALCULATION OF Q(s) AN 


Consiq 

(D) ed ne n variable system defined by 
System ien 5. ) &(t) are of the same order 

are S жарық any equation is of lower or 
TMs of th ials of degree и; but as & result of t n on the 
Clements e transfer functions in the matrix 0(5), discussed in $2 above, the non-diagonal 
P) can are of degree >- 1 at most. The coefficient of s" in each of the diagonal elements of 
Stant te be taken as unity. There are % other coefficients of powers of s (including the con- 
я мна in each of the diagonal elements a ts in each of the non-diagonal 
there In calculating F(s) from Ф(в), or equivalently from the K, and A,, 7 = 1,2; fh 
е therefore nu coefficients to be determin! row of F(s). Since |F (s)| is of 
-homogeneous equations in 


e 
r 
t Bree ny we h б Е 1 $ of nu non 
ет — mu. We obtain from (37) a se MEL ; 
hown that the equations ineach set are linearly 


‚ © сое 
i ffici 
ena of each row of F(a). (оба bes ) 
ent. Thus all the coefficients in F(s) аге uniquely determined. 


aL, а) 
der. The ele 
he restrictio 
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Given F(s) we can find Q(s) from 
Q(s) = P= (s) F(s), (56) 
which follows from (21). It will be noticed that the elements of the diagonal matrix P(s) аге 


identical with the elements on the leading diagonal of F(s). The numerator and denominator 


polynomials of the elements of Q(s) as calculated from (56) may have common factors which 
can be cancelled. 


Given F(s) and the K, and A,, G(s) G'( — s) is readily obtained from the equation 


G(s) G( — s) у Е) K,P'(—s) 5 F(s) К„Е'( —5) (57) 
т=1 8-4, r=1 —s—A, 


which is derived from (27) and (30). Using (19) and (22) we then obtain 
Ө(8) = P-1(s) G(s) G'(—s) Р'-(—). (58) 


The numerator and denominator polynomials of the elements of 0(5) as calculated fro? 
(58) may also have common factors which can be cancelled. 


7. CALCULATION ог Q(s) AND O(s) FROM (D(s) WHEN ALL THE EQUATIONS ARE мот 
OF THE SAME ORDER, BUT THE DISTURBANCES ARE NOT CROSS-CORRELATED 


If all the equations in a system are not of the same order they cannot all be identified unless 
further restrictions are placed on the system. Consider a system (0) a(t) = GD) 0 "i 
Ko € E а are not all of the same order, and in which the arrangement of the equa" 
н сечи — ned equation is of the lowest order and the order of each success” 
T and Gio o и ы gh as that ofthe one preceding it. If (27) is satisfied by the убай y 
an ix €— it is also satisfied by иЁ(з) and 68) where x is an arbitr? d 
arme ^ s. We is add to anyrow of F(s) à multiple of an earlier row correspond! 
Dae иы ih is er without violating the condition that the non-diagonal eleme? 
мез бф» eee than the diagonal elements; but we cannot add & multiple б 
Be aoe E out violating this condition. As a result of this condition, therefor Ж 
тайы: гы E о matrix with zeros above the leading diagonal. Given Ф(5) e 
bue м 4 ыш "s ein is therefore determinate (and so are any other equations oe 
not identified ань oem ien а equations of higher order than the first 009 " 
Е нен У г-ая restrictions on the system. 8) 
жю ПЫ у e disturbances to the system are not cross-correlated, G(s) a Ө 
- If uF (s) and иба) are to be admissible solutions, therefore, ш must gati? 


th icti ; 

= m т that the non-diagonal elements of LG(s) G'(— s) ш be identically at 
Wes MC d жез — is of G(s) G"(—s) the first part of the array formed by г 
"^ e leading diagonal of the matrix product (8) G'( — s) ш iS four 


Pagua 1911031 Puuka 


Pila ИәзбәзЙза Morgi Har + [095g 22//42 


P31 911 Har + Изд 
Ж | 3292242 + 055 933/443 
ce u;; cannot be made zero, we see from the first ele ДА 


" 
Е ish. ment in i + 
w Mors = Е the Second row of the аттау we see that if Sera dim P A rz алб 
Haz sh. Similarly, from the third row, дл, дызай, m © 0 = in * ». E 209% 
аз vanish if 011, 922 
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own that all the elements д; (i > j) must 


ате non i 
apenas e — : z 
ro. Continuing in this way it can be sh 
The matrix jz is then diagonal, 


vanish i 
тт the elements d; (0 = 1, 2; «т 1) are non-zero. 
To d peores in the system are identified. 
И gt (s) for the system we have just been considering, given Ф(в) we first note 
Covariance fun с аар of lowest order are identified without restriction of the disturbance 
ы е-и : ions, it will be possible to find coefficients jn at least one of the rows of F(s) 
The Sous 4- ле appropriate set of scalar equations derived from (37) for all the value of r. 
but er ofi bang will be greater than the number of non-zero coefticients in this row, 
: orm a consistent set with а sufficient number of independent equations to 


determi 

1 1s e 

— ны the non-zero coefficients. In the present case the first equation in the system is of 
rder, so the coefficients in the first row of F(s) can be found in this way. 


From (27) 
27) we hav 

"e F(s) Ф(8) Ё'( —s) = G(s) G'(—8). 
rst column of the matrix product in 


(59) 


Ont К 
he assumption that G(s) G'( —5) is diagonal the fi 


59) yi 
(59) yields the equations 


> X Ев) тӘ -9 20 (= 2,3, т). (60) 
ізі а= 
al values, multiply through by 


their known numeric 
obtain for each value of i а con- 


omials in the ith row of F(s), 
ulated from F(s), and 6(s) is 


If we oi 
4. Ф,(в) and F,,(—s) іп (60) 
—5) and equate the coefficients of each power ofs we 


sistent в : 
1 set of linear equations in the coefficients of the polyn 
can then be cale 


Whicl 
ghost be solved for these coefficients. Q(s) 
s ats) = Q() (8) 9—8. (01) 
Шер 8. STOCHASTIC DIFFERENCE EQUATION ков DISCRETE POINTS ON Чу 
that (2) be the two-sided generating function of the discrete functions ф, defined in (39), so 
‘ rim Ве = 
Mee Й а 
$_, = ф, we have from (40) (63) 
Sub, ге) = М+М (71)- e 
stituti 
ituting (44) in (63) we obtain - 
© rt) = Ue) Vt) V6? gh E: 
wh oe) Tie) uen = "0" (7) PH 
ere W(z) із 3 . tion 
а dix whi fi the equat! 
ny matrix which satisfies Ule) do U'(z). (66) 


+00) 7797 | 
the covariance functions of 


T fol Wee Wer) = Tee igual 
вер, Ows from (65) that Г(2) is also the generating rmm equations 
te variables 2; which are generated py the system 0 (67) 
: gann- WEDA 
о | Я 
m Stationary stochastic variables 6 with covariance functions 
if т=0 (68) 
Efeler = lo # т#0. 
de&ned by Ё 1 = V» ) 


(Th 
e 
Symbol Æ- in (67) is the shift operator 
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i i it time i zals on the 

i i i ified with points at unit time interva + 

i ables z, in (67) may be identifie 5 ; DER 

a MES x(t) на (20), and it follows that the sampling жарасын of сее of 
continuous у 4), E ы idering the sampling propert 

i tudied by considering р G 

tinuous system (20) may be s irvi 

wr rege the Еле system (67). The work done by Bartlett По pm (1949) 

e (1945, 1946, 1949), Quenouille (1947, 1957), Whittle (1952, 1953), V hoe 

d others бй sampling properties and goodness-of-fit tests for eutoregressiv Ae Б; 

и and mixed processes is therefore relevant to the estimation of para 


я E уе been 
systems of stochastic differential equations. Most of the theoretical results which hav 
obtained, however, are valid only for large samples. 


idered іл 
Some sampling experiments are being carried out on systems of the type consic 
this paper. The results will be published later. 
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DESIGN OF EXPERIMENTS IN NON-LINEAR SITUATIONS 


Bv G. E. P. BOX ахр H. L. LUCAS 
Statistical Techniques Research Group, Princeton. University 
and Institute of Statistics, North Carolina State College 
1. OUTLINE OF THE PROBLEM 


We 
supp Р у . 
ррове that some response 7 is a known function 


of k variabl n =f (Si E 66 0.02. xala) = f(§; 0) (1) 
y tige es whose levels are denoted by the elements E oy бр cs SHOE the vector Е and of 
зма ers 01, ..., Ops ..., 0, elements of the vector 0, and that this function is not necessarily 
further suppose that it is possible to 


ut d а Мв variables or the parameters. We 

a ieee сын of experiments or trials in each one of y 
Th ar chosen set of levels of the variables decided in advance. 

€ problem here considered is that of selecting а programme of trials such that they may 


Ө ех В H H 
асс Xpected to provide results from which the p parameters can be estimated with high 
amme may be defined by an N xk matrix 


а, 

Ж 4° - In general the experimental progr à j : 

(6) called the design matrix. The uth row Е, of this matrix with elements 

ina Bins eos ču provides the levels of the I; variables at which theresponseis to be observed 
e uth trial 


f which the response is observed at 


1:1. Example 


ation we have in mind. Suppose а consecutive 


stance A decomposes to form substance B 


C. With the assumptions that the reactions 


at, after time Е, has elapsed, the yield 7 of 


£ ad flowing example illustrates the situ 
Which fien. A "m is under study in which a sub 
are first Mie ENTE decomposes to form substance 
Mtermedi er and irreversible it may be shown th 
late product B is given by 
2 

775 E {exp (70,6) - €XP (70, &) (2) 

Ж of the first and second decomposi- 


is 1 8nd 0, are constants measuring 
> 'espectively. 
e > 
t ome is to choose a set of times En 
the i 
Or e se observations 0, and 0; can 
"i Я орн the 
sim Perimental error in the observations. In this em oh tim 
Whig Y consist of a single column whose № elements 22 “ғо elements 8, 
опе, the yield was to be observed and 8 would contain : je = temperature, ёз 
Маны 7 there will be & variables (for example, Я " vie т 
» etc.) and the design matrix will contam I; columns. 


i ar 
1.2. Practical limits to levels of ? 
as time; tempera 


chere 9 


p the specific rates 


at which to observe the yield so 
tely as possible, allowing 

design matrix D would 
es Eq =o ёш wees м аё 


and 0,. More 


= concen- 


7 
u= 1,2, mo 

р 

be estimated as accura 


{а 
ture, concentration and dosage 


In 

тас; Е ; 
Can р actice, levels at which variables such dy of the chemical reaction above, 
the © set are usually restricted. Bor example. in the stu 7 some value č (max) which will 
be у aSibler: x Ade z will be from zero to ) 1 th 

8 th € range for the variable time (1) % iment. In general, there 


ngest time which could be contemplated D» 
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will exist some experimental region R in the 6 space which delimits the area within which 
trials can actually be conducted. In a chemical reaction, for example, such limitation may 
occur from explosive hazards associated with certain combinations of temperature x: 
pressure. In other cases it may be simply that it is experimentally inconvenient to € 
trials at certain combinations of conditions. In some cases the experimental region can e 
defined by a series of inequalities in the £’s of the type 


£(min) < é; < £(max) (i= 1,2,...,K). 


In general, however, the definition of Ё may be more complicated. 


1:3. Criterion for selection of design 


45 ге 
We denote the response observed at the wth set of experimental conditions by Yu and W 
suppose that 


9 

Ву.) = he = /(6,,9) @ 

d о? u=v Р " (4) 
an E(y,—7,) (y, — h) = 0 uv (u,v = 1,% ғ № 


е ч b 
where, in general, o? is unknown. The true values of the parameters are denoted s = 
буо, 020 ..., Opo, the elements of the vector 0,. The partial derivatives of the response functi 


5 he 
with respect to the rth parameter 0, for the uth set of experimental conditions, taken att 
point Өү, is denoted by 


ra= 5% o 
dE I тт 
Finally, the N x p matrix of these derivatives is ) 
(6 
F- УАР 


: A of 
It is well known that the least squares estimates 6 obtained by minimizing the sum 
squares 


N 1) 
2, Maf Ew 6)? | 
and given by solving the normal equations 
b . 8) 
2 716,9) Cf Eu 6)/20 4-5 — 0 (r = 1,9, esp) | 


nce cov y 
have a Variance covariance matrix which 18 approximated b ( F 


We shall proceed by attempti de 2 
y attempting to choose D so that the determ; "Е)- | is 086 „, 
small as possible. If we assume that each ЖЕ. мене а |E ЕЁ) | a a 


к Er of Ө, we are thus minimizing Wilk's generali : ai 
of this choice, also made b 
қ , y Wald (1943), can be see i i ima 
approximately normally distributed, the de ved а 
contained within any specific ellipsoidal pr 
parameters. The criterion chosen еп: 
Br sures that any such probability contour inclu 
To understand the significance is cri б) 
of thi: ion i i it Бө 
desirable to consider the relationshi Че арылы do а шыш ubi ы ы, 
sample space, and parameter s 


point P(0) in sample Space with co-ordin: Space can be mapped 


ates 7, = 
sample space covered by this рва "id ~ = эже 1, ...,№). We define the P 


to 
op at {1 
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T, H 
"s Min сс же point P9) on the solution locus, we can make an orthogonal transforma- 
тее ялын о ; co-ordinates in sample space such that the solution locus in the immediate 
пра eT. 1 (9) coincides with part of the p-dimensional hyperplane 6; = constant 
transform, ^. N). We сап then interpret |(F'F)-!| as the square of the Jacobian of the 
о ation from co-ordinates (б, sua oy ШЇ sample space to (0,,...,0,) in parameter 
» When P(8) = P(0,). So the criterion that |(F'F)-!| is to be made as small as possible 


Means о Б 
Seometrically that the p-dimensional volume of that part of parameter space asso- 


Clated wi 5 
ed with a region of prescribed small volume round Р(Ө,) on the solution locus shall be 


as small ; : 

a en ав possible. Since we do not know б in advance, we must assume that we can find 
i 1 | 
gn that makes this Jacobian about as small as possible for all points P(0) near 

0). 
Let 
Us j d . 

à p-di $ now assume that the part of the solution locus ne 

mensional linear subspace of sample space. Tf we also assume that the experimental 


e ; 

conia ірісі болы are normally distributed as well as satisfying (3) and (4); then ie 

With poi се region for Ө based on the likelihood ratio criterion consists of all Ponce és e 

sr on the solution locus within a p-dimensional sphere centred on = 2 eurn 

gives chen P(8). and with radius independent of the experimental design. 2 а Ks E 
an adequate approximation to the smallest possible value of the Jacobian for all points 


Toun | й 
а P(8,) therefore approximately minimizes the p-dimensional volume of the confidence 


gi i 1 се 
Te lon 
П parameter space. 


ar P(0,) can be approximated by 


m " 1 eliminary estimates 
W 1:4. Necessity for pr mw 
ele E E that the efficiency of different possible designs depends mm e i А in 
€. 7 are the values of the derivatives of the respons? function wi p 


taken by 
Хо elo em : dent of the values actually 
the ра w the derivatives 2/00, can. only be pem 2” For non-linear response functions 
г, 


the ‘ameter 0, if the response function f is line ЕЕ 
: Т от, depend upon 

the bue of the deriy e the efficiency of any particular Шы. m ipe р 
ET atives, and hence t nat effective design is possible at 


оца] ? se tl 
> Беа rdiet id vit Н ; е ane about the values of the parameters 
hin 


wo ощ 
» ауа also assume therefore that somet 
b О H . 
h information is 
avail Practica] Problems it will almost invariably beithe ба рте the case here 
and à ©, and this will then provide the basis of à first design. r on the reliability of the 
Afon i Where, the real effectiveness of the design ir nee а that preliminary 
Valu Чад On which it is based. In this pape therefor’, that the design would be 


ý i ceed so р 
Pti of the parameters are available and му 2 further discussion of the general 


Ptimay ; 
"tut if these quantities were the true values. ип ы uei. 
for which this theory is appropriate is given by 
7 оет 
Ie ig 1.5. Possible extensions of the ‘ ris zo 
: m к the variation 
Dos andi ion i because the у: 
CM Not be at im some examples А в response function if ix hopes 
Won th мы at all points on wards. TWO obvious exte ae ы кейі dm 
5 ; i ы 
psig oul =. either 2 ards or и Jikelihood ian ч ede ei sae 
о generalize to а тах ied | 
to *n ii or to eels weighted least squares. bentes ae "i na 2 
i sent theory Was applied. 


) and Cherno 


Mak, d out by Elfving (1952 
Ке y Elfving ( "t before 


Tansformation of the mo 


the pres 
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sible to 
istributi i i ld be possible 
i горгі tribution assumptions it shou 
stions of appropriate dis acd cud 
her з ram at least two other directions. In the first place in different pro ut Qus 
extend this "e reliminary values 6* will be different, and we might take accour мн 
j à 51: . . C A ä & ^0 
C ea a ue estimate of the prior probability distribution. ian some он 
ecd 1 to find the design giving 
3 d then attempt to fi g МІ 
s ight be cheaper than others. One cou | а " Ls 
i ^ төлке for a given cost. Finally, it would be desirable in many cir wa den 
ы т —Ó "ou à 
ae sequential procedure in which estimates from a preliminary experiment w өле aoe 
fi lan a second experiment, the results of which would in turn be used xa z : E e 
ied pointe and so on. 'These possibilities will not be further discussed here bu 
may be possible to pursue them in later work. 


2. SOLUTION 
Suppose that we are given the functional form (0) 
Nu = /(®„. 9) 


agt] 18 
4 such that ЕЕ 
and a set of preliminary values 0*. Then we shall choose the design D such that | 
а maximum within the experimental region R in the Е-врасе, where 


10) 
F* = (f) a 
af(6,, 0) 


" ve will in this 
To limit to manageable proportions the possibilities we need to consider, k^ of the 
paper assume that the number of trials (that is the number of combinations о d jals cot 
variables) is equal to the number p of parameters to be estimated. Each of the p tr oted tha 
of course be replicated to attain any desired degree of precision, but it should be n „yation™” 
an 7-fold replication of a design that minimizes the generalized variance using p m ving 
does not necessarily minimize the generalized variance for all possible experiments и to tb? 
rp observations. Experiments in which the number of parameters estimated is equa 
number of trials are not perhaps intuitively appealing. This is 
completely assured that the model is correct; so that one ord 
objective: to check the model and to estimate the parameters 


sentationally adequate, and (ii) because designs in which there 
number of points may provide 


po 
(i) because one is im p old 
inarily has in fact à " ерте" 
in it if it proves to be ЖЕТ) 
are more than the mini 


ion of B would be g? 
їй = 1-0 р 
: | «do 40, Ey 
However, in the light of possible decomposition of В to C it would be appropriate ' n e 
the experiment so as to be effective in estimating both 0 
When nothing 


10 
1 ànd 0, in the model of equ?" pt? 
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the теа и; 
reliability of i 
ý у of informati lE 
we hay : 1ation on which it is based. If this i ion i 
eno basis : ased. shis inforn -exi 
et 10 basis on which to plan. In practi d s mation is non-existent then 
ormation is availabl pn. n practice, however, the situation may arise where some 
model, but is ian able which is difficult to express in terms of extra parameters in the 
this case fut ба. reni in feelings about the way the response surface may be distorted. In 
d ints might be : imus iens in Hie 
doubttu Балкан of nord g : ee to the design in an intuitive way, so as to tie down 
us surface, w at i "ing in mi Е 
etermined ШК айанты a 2 Ше at the same time bearing in mind the locations already 
is adequate › maximum reliability for the principal parameters when the model 
When gro, 
€n gross 
S errors тау В . 
с ла; . ң 
hanges інілік у occur in guessing 0* the present theory can be used to explore the 
Over the region of d occur in the position of the optimal p design points with changes in 6* 
ings тү в, - 2 uncertainty and the losses in efficiency which occurred with given errors 
an p-points were available these might then be chosen so as to minimize the 


Oss of 
efficie 
ney as far + 
1су as far as possible. 
2.2, A geometrical formulation 


iat с ia қ Р : 
lass of designs for which the number of trials is equal to the number of para- 


eters Ез; 
18 à square (p x p) matrix. Consequently, 
and we a (ЕМЕНІ = [F*? (13) 
Sh , А , ; 
ould choose the design D so that the modulus of [Е*| is à maximum ш the experi- 


er А 
de region R, 
‚ under 3s | 
"ple саз standing the problem to be solved and in showit 
e 1 . 
і 8, the following geometrical formulation has 
es are the derivatives 


a p-q 
“Aimensi 

10 В n s. 

nal space in which the co-ordinat 


("5 2) = ¢,(@) (r= 1,5. sas 
беде 


ng the properties of solutions in 
been found of help. Consider 


(14) 


Where, it wi 20, " ы 
Senerate vill be noted, ¢,(E) is a function of Е only. The surface in the space of the derivatives 
ое Y all permissible values of £ we call the ‘design locus Thus with any рано 
in the £-space for a particular design 

f the deriva- 


о ‹ 
Mat i D ‘desig i j 2 р the space о 
т 10П points «эл Е we; E i 
: 5, 5» VIR esign locus Ш р 


. X 

ves ф в Will be corresponding points on the d 

Noy уа 
У th | 

© volume of a simplex in p-space formed by p-points 
; (2 жый 

(2, 212» „эы wees (Zso 289° 23 (рв 4 ^on —- ; 
is pr i determinant 0 he р хр matrix (£j. 
Miei И agit ca for which the corresponding 


boi, tently, w к манна ae 
; We must find р design points ge cum defined by equation ( 14), form with 


th, Sint 
he Origin Fs Space of the derivatives, with @0707 
he simplex of greatest volume. 
Ag 9.3. Example 
ch, a Sim | А in 11 
em; Ple illustration, consider the example m at 


Par alre Г 

ас i lim 

а tion. Suppose that we are given к of time so that the 
12 


ур} ` 


anq 
боп. 59 Origin 


оға consecutive first-order 
07, 0$ for the values of the 
best estimates will be 


apa, ete 
aila; i їз. We wish to choose two values Ёр 
or 0 0 W. 
p 05. We have И 
-0,&)r- (15) 
0. ЖЫ! RA) 
Biom. 46 


7 = (1; 0,05) = 9, — 0 
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2-5. Numerical solution ) 
Although in particular exemples there is an analytic solution to € ДЕНЕ sa 
mizing the modulus of the determinant |F*|, often numerical methods үші m а i 
Such methods are attractive because they allow us to obtain solutions ev en e пей. edu 
tion f(€, 9) is not given explicitly. In chemical examples, for instance, a kinetic € di 
can often be postulated in terms of a series of differential equations which may : t 
explicit solution. The equations may be integrated n umerically, however, оп а compu ped 
the quantities ¢,(§) for any trial value E obtained from difference formulae. P" d 
methods may then be employed to find the maximum value of the modulus o 
determinant. | d of the 
In the particular example studied it is seen from Fig. 3 that in the neighbourhood o sind 
optimum solution the surface is well conditioned, containing no oblique ridges, айа! = 
consequently a numerical method in which the variables were adjusted one at a time W а ie 
be successful in determining the maximum. It is fairly easy to see that the satisfactory > à 
of conditioning indicated by the ‘symmetrical’ maxima found in this problem qu ia 
expected to be general. Often the A surface would contain attenuated ridges which in € 
later stages of iteration would give rise to difficulties in numerical solution when the ное 
of adjusting ‘one variable at a time’ or ‘steepest ascent’ procedures were used. А dt 
which can be employed when the final solution is approached is the second-order p 
similar to that employed by Koshal (1933) in solving maximum likelihood equations. 


3 ated 9 
consists of fitting a second-degree polynomial to values of the determinant calculate 
suitably arranged series of points. 

Caleulations for improved values of £j, and 


= 1:10, 
біз — 6:50 (obtained from Fig. 


£12, starting from the values £j; = 
2) are shown for the present example in Table 1. 


Table 1. Calculation of improved values of maximum 


Eu [2n | -А 

1:10 | 6-50 0-803 336 
1-20 6-50 0-807 992 
1:30 | 6-50 0-806 709 
1:10 | 7:00 0-804 758 
1:10 | 7-50 0-798 920 
1-20 | 7:00 0:809 632 


"T" 24118 
Writing ж, = 5(£,,— 1:10), ж» = (£1, — 6:50), the second-degree interpolation polynomial 
—А = 0-803336 + 0-01525 lx+ 0010104x, — 0-01 18782? — 00142502 + 0-0008722472 
which has a maximum at 21 = 0-66, а, = 0-37, 


that is Ёп = 1-23, £= 687, 


| 
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methods to 
iu i Ж ж. 
җеза А m in which there are a large number oi parameters It is hoped to di: 
s dies : i © dise 
Nevertheless ar ater paper. The examples whieh follow contain only а few parameters ч s 
ss are thought to be of some inherent interest. ў s 


" 3. EXAMPLES OF NON-LINEAR DESIGNS 
onsid y 
er the model 3:1. Simple decay and growth law 
With 4 у = 0, exp (0.,), 0 < (шіп) < Еп < ёз < £ (max). (21) 
Tepresenting ti : 
law, With 0, > PP time and with 0, > 0, 0, < 0. this is the well-known first-order decay 
A г 
ower and upper lin E 0. the function can represent growth phenomena. In application both 
and latest hm mitis £ (min) and & (max) шау be imposed corresponding to the earliest 
sin time at which observations can be made. The value of the determinant is 
The modul A = 0, exp (06-84) Ca ёп) (22) 
ulus zhi 
of which for 0f < 0 attains its maximum value when 
= £,(min), 
11 > 1( ) | (23) 


12 = & (min) — 1/0. 
at the best design will be thatin which 
sent at the first observation 


Ue сү 


Sub . 
stitutir 
ng thes 
£ these values into equation (21) we see th 


r Е 
Proportion e-1 — ini 
= 36:89, of the amount of substance pre 


Tema; 
ains at + 
+ the second observation 


Fo 
г 0% 
2 > 0t} 
1e modulus of A is a maximum when 
Ж е, Ж * ТРЕ” ж 
Su = & (max) € (24) 
£a £j (max). 
expected at 


T 
he first obse 
= hen 36:8 % of the growth 


е fin 


ation should thus be made at a time W 


al ob, 
Ser . 
f Insteaq a has occurred. 
" of the amount of substance decayed we consider the amount of pro 


a simp] 
Ple decay law, we have 


duct formed 


for y = б{1—ехР(® £,)} 
© valų 0 < (шіп) € Ey < ё < & (тах). (2b) 
© of the determinant is 
the ЭР, Жылын „аен 09 
ulu . т. 
1s of which has a maximum when 
(27) 
әт Е (max), 
ШИТІ) es) 
Чы én = - i exp (tbe) 
ар, uti ‘on by which the ield 
t n : " ; the fraction DY у! 
Va] Ime 3 d these values into equation (25) and letting Уг 2 se find that the o ptimum 
Че op 722 = (max) falls short of the maximum ОРОЗ = m by a fraction 
11 Corresponds to a yield falling short of the maximu 29 
Ja -729 М9 


y= api- Пе 


6 Design of experiments in non-linear situations 
8 


icular if the second observation is taken after a sufficiently long time so purs 
с. aia ste is almost achieved, then y, is close to zero and y, = 1/е, so that the le, i 
кзы should be made when the process is 63-2 %, complete. Аз a second ee 
v cuentan observation is taken at a time when the process is 50 90 ашак nid 
and the first observation should be made at a time when the process is 27 % comp 


3:2. Mitscherlich equation 

Consider the model (90) 
7-0,%0,ехр(0,4,) 
for 0 < &(min) < & < Е, < Šis < (шах). of 
А 5 м 

With 0; < 0 and 0, < 0 this expression represents the well-known Mitcherlich 18 
diminishing returns. For example, in crop experiments with & 
fertilizer and 7 the amount of growth, 0, -- 0, represents the grow 


added and 6, the hypothetical growth obtained from an infinite 
measures the rate at which the res 


the amount of adde 
th when no fertilizer 
amount of fertiliza i. d 

ponse to each additional inerement of fertilizer sem d 
The same model has been employed to relate the weight 7 of chickens to their cumu 


e 
feed consumption £, (Hendricks et al. 1932). With 0, > 0 this model has been u$ 
represent early growth (Brant, 1951) 


The value of the determinant is 


А- 04, exp (93 £11) {exp (95 £13) — exp (93 £12)} + баехр (05 E19) 


| (31) 
x {exp (93 £11) — exp (05 £15)} + £j, exp (05 £13) {exp (0, £12) — exp (9s 5.) е 
" + : me jn! 
It is readily shown that irrespective of the signs of 94, 0z, 03, the optimum design 18 obta 
with 


(32) 
ёп = E (min), 


3) 
Еа- = E. + ба exp (0% Šu)— Sia exp (0% баз) " 
— к аке XS E WS) 
3 exp (0$ Eu) —exp (0% бз) ^C 

ёз = B (max). 
Tn particular in a fertilizer е 
tilizer and £,, = 2 (шах) 


maximum by a certain then £,, wou 


fraction y, where again y, would be given by (29). е jat? 
st maximal response then the intorn d 
0 yield a response of 63-2 % of the m? 


(39) 
7 = exp (— 0, £, е-%6) 


for 0 € &(min) < 


ба < б» < £j (max), 
&(min) < & < E 

This would apply for example where 
t = Ё, ofa material decaying accordin 


E (тах). ү 
7 represented the 
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Sensitive, А. 
. Assumi { 
зна сабк mes. е the temperature effect follows the Arrhenius rule, 0; is th ifi 
e = 7-1. 7-1; emperature 7}, 0, is proportional to the ‘acti "s өлшей 
rs ^ j tivati қ 
oft ois the deviation of t i di- E 
h 4 bas н the reciprocal of the temperature, 7) from the reciprocal 
Ve fin a ГИ 0; 
d for preliminary values 0f, 0f that 
blu = Бекр Ut теа) = ЕТ 
2 OF % 


Palë E) = ЕЕ, 6) = Ev Ing* 
ibus —— "e 2 f£ 9, (Ey Se) = — 5:7 In g*, (36) 
= ех = с —0%, 
I Of č u e-?15:), the expected fraction of decaying substance remaining in 


the 
uth experiment, Thus 


Us 

Жаа —£ тўш 

ж 1 62171 

modA = mod | ® VACUUM ЕСІН 
yt OF (22—21). (37) 
inn бы ШИ 


Also, if the range of permissible values for 


Now — 
37) is maximized by ensuring that 


710 " 
the times s 7 hasits maximum value when  — 1/e. 
ў s 
mE deat is sufficiently wide then the expression ( 
achieve the maximum possible range £,(max)—£,(min) and adjusting the 


imes of 
reactj А 
ction so that the anticipated fractions remaining, 7), 73, are each equal to 1/e. 
case are obtained by substituting 7 = e! in 


© actual 
values of £,, and £,, required in this 
Забей with (min) and (тах) are then 


9quati 
on (3 
en b (35). The appropriate times £j, £12 85806 


giv 
1 à 
Ба = gs exp (OF Ein) 

1 
bya = ge exp {OF Eman) (38) 

1 
Suppose the absolute temperature is 
enient to take as an arbitrary base 
asured in minutes а guessed value 
ture is ot = 1:0 and а guessed value for 


It ig 
9s igi dre to consider a numerical example. 
temperature thee range 380-420°A. It is then con” 
бу the rate р Т, = 400? A. Suppose with time Шо 
* = 16,000 i i at this arbitrary base tempere 
; then we have 
0f = 10, 6 = 16,000 
380^A. < T = (&+0'0025)" < a 


Th 
my. ОРЫ: қ 7 
щі, mum design will consist of two eXper ents, 
91 min. and the 


te Uum t 
m emperat č i = 

era; ure of 380° A for time ёп = 
od are > of 420° A for 0-15 min. These times aro found кан rrec 
"a "bad those values аб which, if the guessed Шы - a ы The 
Р ошер ntal error, the fraction of product remaining WO d be 1/e- Аз 
y nde, cally in Fig. 4. In this figure the p the sl E es 
құ , н the values ф, (5, £2) = 0 and 058) = рн To fines in 
оо Sta Spectively, that this derivative ст take. The а fin 

nst, ti (i.e. for constant temperature) and the curve eil 

Ime). Where, as we have assumed above: the ре 


the first of which is performed аб the 
second at the maximum 
tituting in equation (38) 
t and there were no 
design is illustrated 
(1, £3); [GE £&) is 
t and the smallest 
Fig. 4 are for 
for constant & (ie. for 
e range of & dictates the 


design locus 
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design points selected, the design locus is restricted to that portion of the plane within the 
triangle bounded by the three straight lines 


ф1(61, 5) = — 10е, 
ф(51, 55) = ОФ, Ё) ča(min), 
Palën 2) = –0#ф1(,,,) (тах). 


The triangle so delineated is itself the triangle of largest area which can be formed with two 
points on the design locus and the origin with only temperature restricted. 

The appropriate design for the case considered is indicated by the two heavy dots in the 
diagram. It will be seen from the figure that, if £, were the restricting variable, then a some 
what different result would be obtained, since now the triangle of maximum area would be 
achieved when the value of $, (&,, 5.) fell somewhat short of — 1/0ře. 


Ға (min.)= 4, 


390 
3 2 
T 
e ~ 
E NES 
© - 
+, 400- um 
+, 400 о = 
т © 
I x 
£ 71% 
Е 
410 -2 
x 
420 4 


-03 -02 -041 0 
— $1 (61. 83) А 
18: 4. The design locus for 7 = ex Ж B t 
= exp(— I-0£g-160007—-09025)) тр : i in) = 38 
d т > - The optimal sol T(min) = 
(max) = 420° A. and 0 < Е < 00 is indicated by eg tee. 

4, CALCULATION OF THE ADJUSTMENTS y 
Suppose a design of this type has been obtain үр 


Y. Then proceeding ; 1 way 9 ^ ct 
to be applied t кы ng in the usua А fir? 
approximation by minds preliminary values 0* are given to ? 

бедеулік, ар) 
or in the special case where the nu y 
mber of : я a 
meters p by er of observations N is equal to the number of P 
6- Fey ұз), 


G. E. P. Box anp H. L. Lucas 
89 


In tl «ample of 2 3 
пе і 
particular exa P 3 


ала 
e 
ge КЕ 
sgj 


50 that t ; 
he adjusted values will be 
0, = 0+0), 
0, = OF +8» 


lements in y —* on the columns of 


Where 

ó. 
1 and 0, ar à 2 
ан -— we the ‘regression coefficients’ of the е 

Osee th е 
е ; 

Such that E C of this suppose that the v 
бы: — Šoa, then the adjustments in 
= -@—е ей, 
б, = (1— Yo) e[2En- 
he observed values of у were equal but different 
nt was needed only in the basic rate 
ould imply the necessity for 


alue of 0 were taken at à base temperature 


0$ and Oy are simply 


As 
Would b 
ш the e аач: therefore, when t 
gp ни 0% ^» c1, this would imply that adjustmer 
Tustment in — values for y; and ys 91 the other hand w 


5. Discussion 
at particular sets of 


vations made 
ted and consequently 


highly correla 
t of poor design in which case the 


n a more informative set of 
Not infrequently, however 

rty of the model itself, 
This point is well 


ated from obser 
the estimates are 
e simply the resul 
arting values to desig 
tes can be obtained. 

stimates 18 а prope’ 
related estimates. 


When 
Sond ое are jointly estim 
Ve very mic id happens that 
5 group ein variances. This may b 
\ 8 from t eapi may be used as st 
an urns out уну greatly improved estima 
ill hat iino "ex unsatisfactory nature of thee 
Strated bem i best design still yields highly сот 

à odels 

els such as 146,6 
па 
2 351 

у outlined, the optimal design will be found by 
hich together with the origin form thetetrahedron 
ch a model often turns out to be a nearly 
riable, г, the volume of the tetra- 
р. This arises from the nature of the 
examples, when one has found one 
by making 


f data, one can, 
ite different values of the 


fria] 


In "- 
rd: 
of Sing а with the method alread 
"ai Catest ә. on the design locus W 
e ht line ee But the design locus for su 
mo, Ton ang іе о that for any reasonable range of * 
вер + itself, c: the value of the de s sma 
Sita, Values of 2 values of the Z's are 
% ble com Һе 0’з that most closely ag” 
pensating changes іп the parameters, 


terminant i 
chosen in 


em we are considering 


t 
at а, 
p this 1. almost as well. 
: s т nnexion it is important to distinguisl between the gu ters 0 
the n what s hat of finding designs most € cient 107 gem ee eee 
ifferent hich is not discuss gre, Ра" : lesie 
problem w hich is 70 in a given region. А particular design 
lated with large 


d most recisely 1 * 
н ters are highly corre 


Tha, рон 
еу = ; 
vb 7 = f(8; 0) is determine! 
f the рата 


e su 
с 
h that the estimates, 9, 9 
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T à . £hisds 
variances and yet the response, 7, is estimated reasonably accurately. The reason for this 5 
not difficult to see. Instability in the 0's which is compensating is by its very nature not suc 
as will cause instability in the estimate of y. 


The authors wish to acknowledge their indebtedness to a referee and to E. M. L. Beale for 
valuable suggestions for improving the presentation. 
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THE ES 
5 ESTIMATION OF MISSING AND MIXED-UP OBSERVATIONS 
IN SEVERAL EXPERIMENTAL DESIGNS 


Bx J. D. BIGGERS 
Department of Physiology, Royal Veterinary College, London 


І. INTRODUCTION 
esigns depends upon their fully balanced 


Тһе і 
simp xi = 
ple analysis of most experimental d 
w of the units in the course of an experiment, 


атгатое 
m . 
Bn = ent. Often observations are lost fora fe 
over Ө "e spi à 
ercome the consequent difficulties n the statistical analysis, because of non- 


ОТ осо. H 

1990, Yan the technique of inserting ‘ missing values’ was introduced (Allan & Wishart, 
In "i es, 1933). The values which are inserted minimize the error sum of squares. 

а мба paper the problem of calculating these missing values will be investigated from 

Q) a point of view for the following designs: 

(2) randomized block 

P т 1 | H H H 

(3 andomized block design with replication within units, 

le cross-over g 

lati 4 

5 atin Square and higher squares, 

9) Split-plot, 

Just; : 
Mating mir of the variances of treatment conpariso 
cedure nor values, but although this can be done 

Will not be discussed in this paper. 


ns is sometimes necessary after esti- 
by an extension of the present pro- 


К NOTATION 
ir Um in italics. 
Bice nce case letters in Clarendon type 
ransposis capital letters in Clarendon Жүре» — 
ummat sition of matrices and vectors is indicate y a If we 
(ii iiti qan summation symbols will be E . ИВ 
› 5... п), we define ordinary summation over 8 values ав Zt; 


Жо = 


consider the sequence 
Also we define 


ically one relating to the 


where the number 


noted by Sign... 
in a ran- 


nis de c 
Г d. Thus ап observation 


nina desi 
n observation in a Graeco- 


8800; | 
rs i. observations—an observatio 
Өліге = indicates the order of class 
ші Squa: ock requires two subscripts for 

Те requires four. " " Thi 

us i cripts in common. 8 

à m арыйт бако диет they may а cor oar ratio observations 
Fn eta te днее е relation C we consider two observations 


о 
Te] 
ated М 
in 4 р 
a Specific way will be called associates- be 
ciate can 9 


Tor and 
) 2 Хм. Тһе following classes of asso : 
B ppg AAE 


‘on, and а 


-order associates: 
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(2) lst-order associates: (і) i associates f-2ugcvhz ы 
(i) j associates f + и, g =v, h + w; 
(ii) k associates f + u, g + 0, h = ш; 

(3) 2nd-order associates: (1) 2 associates f=ug=vh+ w; 

(ii) ik associates f = u, g + v, h = w; 

(iii) jk associates f + u, g = v, h = | 

Particular associates may or may not exist depending on the MM d ір 

А potential observation in a design will be indicated by a capital letter: X ijo 1 
fact a missing observation it will be indicated in lower case: а... 


General solution A: 
Let there be р independent missing values denoted by z;(i = 1,2,..., p). The Garton vel 
Squares is a quadratic function of these values, (а, еу 2р). The vector of partial с 
tives of this function is given by (1) 
2(x'A — q')]N, 
Where x is a column vector of the р missing values, 
Aisapx D symmetric matrix v 
design and the distribution 
q is a column vector Whose elem 
Nisa constant determined by the experimental design. 
The values of the z; where F(x) is a minimum are 
equations obtained by equating each of its parti 
notation: 


өлігі 
Я -nerimel 
"ове elements are determined by the exper 
of the missing values, 


1018; 
; servatio! 
ents are calculated from the available observ! 


18 
х а е aneo! 
given by the solution of p pappa” 
al derivatives to zero. These are, in 

(2) 
Ах = q. 
Several methods may be used to solve 


‘on and 
equation (2). Most textbooks on the desig 
analysis of experiments give the formula 


cle 


d block design (Baten, 1939, Те 
Federer, 1955). The split-plot eed 948) 
Anderson (1946) and Khargonkar ( pat 
е than one missing observati 


+ ne 
3 | explicit determination of the a j 
equations. In this Paper rules will be given which enable (2) to be written down rap! ple? 
the different experimenta] designs. The main advantage of this procedure is that it pa 
the nature of the equations to be examined and this leads to gi lifications Ш sev 
important complex cases, ешге que 
The matrix method has been the sub ing 
Ject of a paper hy 56 issing “e? 
in the randomized block design, This idee, ЕЗ ү роз чи = is кейін 8 
distribution of missing values, presumably because wt has Only dealt with a 10% » 
of A. Very Tecently Wilkins : 


йе” + 
alues of А-а are written dow? pi 
has Shown how t 


com \е matrix A eam be used t0 С i? 
aii Зо, . PATISONS, and 
estimation of aL ватт, Ale. 
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— - II. RANDOMIZED BLOCK DESIGN 
random, We p» : — each containing / plots to which ¢ treatments are allotted at 
Pe ^no servati ji l е 
X, ers m P observation made on the jth treatment in the ith block by 

‚7; = 12,14). At the end of the experiment let p of these values be 


+ 
, 


Missing, | 
5. Itshould b sd that oar à dod 
ү Subscripts e noted that the missing observations are indicated by only some values 
et the itl gs 
1 block and r . 
block and the jth treatment contain с; and d; missing observations respectively 


len 
p= XE; Xj; 


Missi 
88 
Sing observations in tl А А 
s in the same block are i or block associates and missing observations 
alled j or treatment associates. The missing observation у, 
Qon 


Teceivi 

ng the в; 

5 the same treatment are с а 

sociatesand (p — (c, 1 
gt ay ) 


Villha 
н ave therefor ; 
Zero cae (c, — 1) block associates, (d, — 1) treatment as 
Let B | 
m. total of the available observations in the ith block, 
G ier of the available observations receiving the jth treatment, 
otal of all available observations. 


Th 
e erp 
or term i à a" 
rm in the analysis of variance is given by 


І 


1 


E. 

Уна у б керн n D өр 
бы ha vij Yip (By + Sept) ЛТ И (3) 

s п iati e Е 

*quatin піні of (3) with respect to à particular missing observation, ау, and 
€ the derivative to zero, gives the equation 
(Уук + Хо 
i and j associates gives 


"ж, TEE, =r. tr,- G. 

Separatio My — TNX oj "В,-4,- 
n of ter. р 

of terms corresponding to zero, 


zero 
( i associates y associates associates 
r= 1) do я ( SS “атала ууулу = 1B, +t, — G. 4 
Ex ( 1) а (1-7) Б t) Хә Xo у 2 h (4) 
the coefficients of the ау) the B, and 1, and hence 
f A are: 


determine the elements o. 


Dres, q 
| Sion * 4 
the : (4) enables us to write down 
sideration, 


Matrix 
p „А and the vector q in (2). The rules which 


(i) 
T 
(ii) (1 11 1) for the missing observation under con 
(iti) aL E for the i or block associates, 
(i ) ) for the ог treatment associates, 
ular, and thus the equation (2) will 


lf 
о 
h Pr: T the zero associates. 
ау асса] sit е | 
uni uations the matrix 
mj, mpl que solution. 
СЯ % 
ex ing val - Table 1 shows the results of a random 
(Ре he; oceurred in this experiment, but fo 
4 IT NES 
: 9 missing values are calculated a$ follows 
orresp' 


е ri Е 
“senti ite out the table of results so that the rows € 


A will be non-sing 


ized bl 
ur have been 


ock experiment (r = 8; t= 5). No 
omitted to provide the 


ond to separate blocks. This is 
28 for the missing 


associates. 
ociates and 1 for 


alt 
3 0 avo} por: th 
" Obtai. oid confusion in determimng the E n 
ы. ati n the coefficients of the associates. In th ree 
"id Оп under consideration, — 7 for thei associates, 

" (3 ne 5800їаќев, 

& Sert ; 

termi, ert in the table of results the letter fi net ч 
Ned by the position of the missing value in the table, е.5. #35 


alue with subscripts 


жи for the missing V ita 
© the missing value in 


\ 


94 Estimation of missing and mixed-up observations 


М imated 
the third block receiving the fifth treatment. Space should be left to insert the estima 
missing value. 


Table 1. The wet weight in mg (x) of embryonic chick 


aye TT ically 
tibiae after cultivation on a chemi 
defined medium containing five different concentrat 


are 
ions of glucose. Treatment numbers 
placed in parentheses. The variate analysed is log 10x 


ен | 


Glucose concentration (mg./ml.) саш 

plete 
Block я В; block 
| totals 
4-0 (4) 8-0 (5) 


0-5 (1) 1-0 (2) 2-0 (3) 


I 0-88 115 1-33 6-82 
II 1-06 1-22 163 | 154 1-37 — 7-00 
ш 0-97 L34 | (ти = 150| 1:66 (£as = 1-53) | — 3-07 (By) 6-44 
IV 1-09 1-21 116 1-50 1-48 — 7-05 
у 1-14 137 1-58 1-52 1-44 - 715 
VI 113 1-33 1:65 1-57 147 — 
L21 | (a, = 1-44) 1-45 1-52 
р 


Б) 5:18 (В;) аа 
ашт 112 1-30 L35 (а = 1-56)! 16 5-38 (Bs) 
т, ЕС 


1:39 1:57 


= 8-70 (Т) 10-63 (T) 10:46 (7,) 48:31 (G) E 
m S ақ mm | 
Completed 

treatment 8:39 


10:13 11. Я 
totals 64 12-19 


.34 
11-99 = 54 


Table 2. Calculation of ^ and q 


А 4 
Subscripts 
igus Lon | a ae 
33 35 73 84 B, T, G 

А 

33 28 М e 

35 -7 к Е: 1 9-97 8-70 4831 

73 -4 1 1 3-97 10-46 48:31 

84 1 1 28 1 5-18 8-70 821 

1 28 5:38 10-63 4 
=l 
r=8 $=) 
(4) Obtain the В, Т, апа ғ s or de 
» 4 rom the availabl ; қ he b 
of the table. © observations and write them at t 


аф 
һе subscripts in a row at the top and a colum 
pairs determine which ot? 
› and the appropriate coe e 
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+; 
able columns for the B;, T; and б are prepared. The i subseript in the left-hand column 


determi с i i 
2... B; total in a given row, while the j subscript determines the T; total. Beneath 
гапа T; columns the values of т and t are written down respectively, and beneath the G 


column the value — 1 is written. 
6 : i i 
(6) Invert the matrix A. If the exact variances of treatment comparisons are not 


requi = Е "s SNR. 
Шы А ! need not be obtained explicitly, and it is quicker to solve the equations (2) 
m with q determined as in step (7) below (see, for example, Fox (1950)). 
) Write down the solution as follows: 


Act q 
E 0-039 0-010 0-005 — 0-002 26:95 
35 | _ 0.010 0:038 0-000 — 0:002 85:75 
273 0-005 0-000 0.037 —0-001 36-63 
\ 284 0:002 -0002 - 0-001 0:036 47:88. 
h row of the right-hand side of Table 2 with 


Qisobtai 
the cce by row by row multiplication of eac 
om row. Thus, in the example, the uppe 
| TH 8x 3:975 х 8-70 — 4831 — 26-95. 
he 
Nieder ues of the z;; are then obtained by standard row by column multiplicati 
ple х3; = 1-50; xy; = 1:53; аз = 1-44; vg, = 156. 
1, and the block and treatment totals 


8 
oe values of the'a,; are inserted in Table 
pleted. The standard analysis of variance then follows. In this example 4 D.F. are sub- 


r entry of q is given by 


on. In the 


% 
ұза from the error term. 
th pecial cases. The process of calculating А-1 can be simplified if Aisa pxp naded 
e form: 

a b b 

b a b 
(5) 

b b 4 
Jements of a row i$ а factor of its deter- 


sum of the 6 


trix is singular. This property will be used later. 


ТЫБ, ns 
e cireulant matrix and hence the 
ant, Thus  {а--Ыр—1) = 0 thems 
| inverse of (5) is given by b 


(a4-b(p- 2) 
b 


ПЮРЕ — 
(a—b) (a --b(p — D} | 
n -v (a--(p- 2) 
| Thu = 
2 i down directly- 
own, А-1 can be written do i кеп 
» еч cases where this is useful in the randomized plock design 
limitat | e 
mining cbrerrations aro win s cm treatment associates. These matrices also 
пе: 


(2 d. 
754 M missing observations are еї 
n other experimental designs. 
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SATION Wr y UNITS 
III. RANDOMIZED BLOCK DESIGN WITH REPLICATION WITHIN U 


d at 
z taining £ plots to which £ treatments are cmi " 
EE E d fan jn bur nri vi 8 manda 4.1. -im lysis E Em en 
rdiet de in Та с present time there is considerable discuss wr 
Un сі жін а 35 ны of this design (Anderson & Bangroft, : 
Қарар 52. ene 1955). The models dutenmina the expectations of the mean mee 
Ше Table 3, and these in turn determine the appropriate error terms in 
of significance. There are three error terms to consider: 
(1) the within-plots sum of squares (E;) (Case 1), 


(2) the blocks x treatments sum of squares (Ен) (Case 2), 
(3) (Е,-- Err) (Case 3). 


t 
: а vations per pl 
Table 3. Analysis of variance for ar x t randomized block design with s observations ү 


Source of variation D.F. Өр ОНР 
Between blocks (r-1) ры E: ^ 
Between treatments | (4-1) E i^ T+C) 
Blocks x treatments interaction (Ej) | (r— 1) (t— 1) (P— < _Р) 
Within plots (Е,) rt(s— 1) pes 
Total (rst—1) аны 
Where S — total sum of 


Squares, C = correction fo 


T great: 
= plot sum of squares. 


r the mean, B = block sum of squares, 
ment sum of squares, P 


We denote the kth observation on the jth treatment in the ith block 


by 
Xu. (i = 1,2, 975) 21,2, 5k 1 


2... 8). al 
дра, | tot 
At the end of the experiment let Di; observations be missing in each plot. Then the 
number of missing observations is 


P = Х,У ур. 
The following associates only exist: zer 


о, $, j and ij. 
Let P; = total of the available observations in the ijth plot, 
В; = total of the available observations in the ith block, 
T; = total of the available observations receiving the jth treatment, 
G = total of all available observations. 
Тһе thr 


in 
mine the elements of A and Ч for each case are 809 
Table 4 4 10 
B i р. 
If we write down (2) for these cases, ordering the elements of A in groups correspond! 
the ij, we obtain in general: 
АН А... А, хп qi 
Ay AB "EET m 
А; н А, п . . АЯ 


т.т x" а" 
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along the main diagonal, one corresponding to each plot. 
matrix of order p;;. The superscript denotes 
the position of the submatrix, 


Where Аа = P) are submatrices 
Each isa square symmetric 
the plot, while the subscript denotes 

А.да + £) are rectangular submatrices, 
и is a vector of the ру; missing values in plot ij. 
q” is a vector of p;; elements caleulated from the available observations. 


Table 4. Coefficients of associates and typical elements of q for the three cases 
in the randomized block design with replication within units 


Coefficient 
| 
Associate = т n 
Case 1 Case 2 hs 
(8) (Ej) (E, 4 Ej) 
Em 
Missing value (8—1) a wi a pim 
2 | -1 (r— ЗЕ 
р | 0 (1—7) (1—7) 
4 0 (1-0 (1 - t) 
| Zero 0 1 
| | Typical мат 
| 
ы қ NNNM 
аын: Р, fo е = 
Case 2 7B, (T, - iP 7 8 
мін В+ 
s ding row and column in (6) 
ч i t i e correspone с 
Vani Plot has по missing values the submatrices of th eal пену а 


боер, Тһе elements of the submatrices along the ей” 
ents for the missing value and its ij associates 0 y. 


e no ; 
W consider the three cases discussed above. 


Ase 1. 
The matrix of (6) is of the form 


As o (7) 
о А 
Si Ант 
Nee tha; 
he Mverge of (7) is 
Аш à о 
о Е 
Аян Biom. 46 
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the estima v issing v in all other 
imates of the missing values in plot 1 is independent of the missing alues ina 

lot "Therefore the missing values for the ijth plot are given by the solution o 

plots. f 


(s—1) =I ae” —1 Vin T5 
=, = „ кй Тур By 
-1 =} s» ф-Ы Іш, By 

Pij X Pij 


" jons 

This has the solution P,;/(s — ру) for all the г, Thus in this case the — шан Ж 
are estimated by the average of the available observations in the plot. If а ре 
a plot are missing p;; = s and the equations have no straightforward solution. — 

When r = 1 we have the case of a fully randomized design. Thus an estimate o | 
value, if required, is the average of the results available on the particular чеин 3 A 

Case 2. The submatrices, in this case, have equal elements throughout. The ee ad 
have elements equal to (r — 1) (#— 1) while the А.(о + f) have elements equal to (1 t identionl 
or 1. The rank of matrix A in all practical cases is rt since it consists of rt sets о i "alus 
rows of elements, Thus to solve (6) we may allot within each plot (p;; — 1) arbitrary | пез for 
and then solve for the remainder. In practice, therefore, we would insert arbitrary ie (6) 25 
all but one missing value in each plot, say P,,/(s—p,;) for 8 + pij, and then write we plo 
though one value only is missing in each plot. Alternatively, if we work entirely v "rr 
totals we can use the formula for a randomized block design to estimate a аце. 
ignoring all the available observations on the plot. This situation arises when we m 
the whole-plot error term in a split- lot design. . һе 

Case 3. Here the matrix of (6) is теа з and cannot be simplified. In this cas? ы 


jn 
Е s s z x cx allow’ 
matrix remains non-singular even if all observations are missing from a plot, thus 
estimates of these values to be made. 


IV. CROSS-OVER DESIGN 
Consider an rxt design, where each of the ¢ columns is a Separate replicate ti mes 
r treatments, subject to the condition that each treatment occurs an equal number О 
(а) іп each row. Then t = ar. 
Denote each observation. by Xili, k = 1,2,.. 
missing. In this design the following associates only exist: Zero, $, 9, k, ik. 
Let В; = total of the available observations in the ith row, 
C; = total of available observations on the jth column, 


Т, = total of the available observations receiving the kth treatment, 
G = total of all available observations. 


The following rules determine the elements of A: 
(i) (r—1) (t—2) for the missing observation under consideration, 
(ii) 2(1—r) for the ik associates, 
(ii) (2—r) for the i and k associates, 
(iv) (2-4) for the j associates, 
(v) 2for the zero associates. 
A typical element of q is given by 


"Е,-0, 7T, 2G. 


of th? 


«ons 96 
әт-1,2,...,4). Let p observatio 
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When 
тей 3 " 7 А i 
Жыр the coefficient for ї and k associates 1s Zero. This leads to independence of the 
or estimati SENT : : s 
cima estimating the missing values in some instances, thus allowing the formula for 
D 15511 7E 5 imi 1 1 
ng value to be used separately for each plot. A similar situation arises in split- 


plot designs (see Example 2). 


V. LaTIN-SQUARE DESIGN 
an be obtained from those for the cross-over 


The f, 
or 

rmulae for a x t latin-square design С 
1: associates do not exist. Thus the rules 


desi ч 
nisu ing a — 1. Under this restriction 1 
б) g- pa elements of A for the latin-square design are: 
Gi) (20 ү -2) for the missing observation under consideration, 
(iii) 9 for "i the i, j and k associates, 
À typical ele e zero associates. 
ent of q is then (Ry +C, tT- 2G. 


Suppose VI. HIGHER ORDER SQUARE DESIGNS = 
enote tin have a Ех t square of order n, where n dagoten the number ps classifications. н 
Particula: RN made on each plot by Xaas. an ia zl д. 2; f E к ажа 
ва. Let іі 55 will be denoted by Ха. де where the J; repre 3 
n all fin 0 servations be missing. 
tC _ e designs only zero and 1st order as 
G а не; of the available observations on 
У an dina al of all available observations. 
n of the methods already descri 


of (2) . 
18 
(F- 1j: 1 m)ng as. a2 eae 
as, fs да P 


131-1 Diop Des) an natn 
ia d уыш ant fn 


sociates exist. 
the ath classification, 


bed it can be shown that 2 typical equation 


(an) Vj ag... an 
int fn 


mor Gay ag fn 
ри 


E eR » 
21-0 Хы 
aci DIR 

-1 ЭХ! Ulap Pa aa an 
+ (п e fs anc Bn я 
ыбы ОС ali 
7 i 5 for writing down the coefficients 0 
b atriz e may determine the rules shown 1n Tete e ac NAA шыйк 
cidem ве 3 determines the coefficients of 

earlier. 


for vari 
ous values of n. n = 
az. T of = 
ан treatments equals the number of blocks. ” ula given 
q f the form 5 mo. 
g then matrix А is 


«atin В 
fing PQUuare, thus giving an alternative proo ciates only are missin 
1st order 8580 


‘ofa the hi 
the fors Eher square designs (i — 1) j 
(5). The sum of the elements in a row В 


(-1)4-т! 


Prom = С, + 1С, 


ape 
an +(t- 1) (n қ D 
uic theres ) еве straightforward solution. This 
LN Te the matrix A is singular and equatio? ( E assification, 6.8. а complete TOW, 
um Jues are in 0 hods for dealing with missing 


ari 
Valy or iig whenever all the missing V? iption o met 
өз: eatment in a latin square. п descriP given by De Lury (1946). 
7-2 


atin- Re ris 
In-square designs where is singula 
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Table 5. Coefficients of associates for square designs of different order 


Хо. of constraints 


to 


| 


a 


| | | 


Associate Randomized | Latin square | Graeco-latin — H yper-graeco- 

block | Square , latin square 

dc | | 

= = | - жие 

| | 

Missing observation| (4-1)(4-1) | (t—1)(¢—2)  (—1)(—3) (4-1)(-4) | и-1)@—п+1) 
Ist order (1—1) (2—1) | (3—1) (4-1 (п=1= 0 
Zero 1 2 | 3 | 4 (n- D 


VII. SPLIT-PLOT DESIGNS 


(1) Whole-plot treatments arranged in randomized blocks 
Consider a design in which r blocks containing ¢ plots to which t whole-plot treatments ae 
allotted at random. Further let s split-plot treatments be allotted at random within eae ‘i 
whole plot. The analysis of variance of this design is given in Table 6. There are tw error 


terms to be considered: (1) E; corresponding to the whole-plots (Case 1), and (2) Ем dit. 
sponding to the split-plots (Case 2). 


Table 6. Analysis of variance for a split-plot design with whole-plot treatments 
arranged in randomized blocks 


Source of variation D.F. Sum of squares 
| - — 
Between blocks (--1) | (B-C) 
Between whole-plot treatments (A) (t— 1) (S—C) 
Error (Ej) (r—1) (t 1) (W-B-S+C) 
| — x ———Á 
Total (rt— 1) (W-C) 
Between split-plot treatments (B) | (8—1) 
қ 1 = т-с 
ДЕ interaction (s—1) (2—1) (Т E- - A C) 
rror (Е) | 8—1) (r— 1) (Z-W-I-S) 
Total | 
| (rst — 1) (2-С) 
Where Z = i = 
rection for the Етік нь yl 4 a е i utc air 
‚D= uares, § = T = 
plot treatment sum оҒ Squares, І — treatment тн pen 
res. 
We denote the observation made on the i 


kth split-plot treatment іп the ijth plot 


Жый EN ME : е d 
ijr ( (53 = ,2,.,6 Б = 1,2, ...,8): At the end of the experiment let 7 spl 


plots be missing. 


Тһе following associates exist: zero, i, j. k, ij, ik jk 
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em ear — that the exp ression for the error term is the same as that in Case 2 of 
the avails bled a! dis block design. The solution in the two cases is identical. We ignore 
тйрй гөй Ы Serv ations and estimate new whole-plot values using the formula for 
ock design. 
Case 2. Let W, = total of the available observations in the ijth plot, 
Ij, = total of the available observations receiving the jth whole-plot treat- 
ment and the kth split-plot treatment, 
№; = total of the available observations гесеї 
treatment. 
P3 Pin which determine the elements of A are: | 
(8) 1 )(s—1) for the missing observation under consideration, 
(ii) ü sa) for the jk associates, 
бу) 16 ”) for the ij associates, 
(v) or the j associates, 
0 for the zero i, Ё and ik associates. 


ving the jth whole-plot 


A tym} 
Ypical element of q is given by: 
п, +81 e 


n square 


(2) Whole-plot treatments in а Ux t lati 
lot treatments ате allotted at random, and 


Consid 
der its ts AMD. E Je plot. The analysis of variance 


urt, len 

"i т 8 split-plot treatments be allotted within each who! ne 
nin Table т. As before there аге 0570 error terms to be considerec. 
Table 7. Anal ysis of variance for а split-plot design with whole-plot treatments 
dina latin square 


^ arrange 
Sum of squares 
Source of variation T ем 
| (8—0) 
ы адмның rows p n a o 
а columns ( ей м n . 
3etween whole-plot treatments (A) ( I w- ke 6120) 
мый 0 6-2 
(в) B | 
| mui т-0) 
Total " " ( 
NEP а ДӘРЕ аны 
mem и. и 
Ап na split-plot treatments (B) m 4-4) i mM 
E interaction dp | | 
“rror (En) й 
Em dut pcr (0-0) 
Total ри қ 
ares of whole plots, С = cor- 


8- whole-plot treat- 


Тер. 16; 
is 2 Шы А ү = ares; Е 
men, а for Pitas sum of squares of split-plot*: p = column sum, pr treatment combination sum of 
lug, Pm of A vado бду p et sum of square? pe 
8. uares, Т = split-plot treatmen 
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We denote an observation on each split-plot by Хи (ij, = 1, 2, зақ ! = 1,2, Р Ee 
where i denotes the row, j the column, kthe whole-plot treatment associated with the plot 9» 
and 1 the split-plot treatment. Let p split-plots be missing. . m 

The following associates only exist due to the restrictions on a latin-square desig 

i,j, k,l, il, jl, kl, ijk. , 
y c If т A minimize the whole-plot error we ignore the available observations 
and estimate new whole-plot values using the formula for a latin-square design. 
Case 2. Let W;; = total of the available observations on the ijth plot, nai 
Ти = total of the available observations receiving the kth whole-plot tre? 
ment and the lth split-plot treatment, lot 
5, = total of the available observations receiving the kth whole-P 
treatment. 
Тһе rules which determine the elements of А are: 
(i) (2—1) (s— 1) for the missing observation under consideration, 
(ii) (1—t) for the ijk associates, 
(iii) (1—3) for the kl associates, 
(iv) 1 for the k associates, 
(v) 0 for the zero, i, j, l, il and jl associates. 
А typical element of q is given by tW;j + 515, — Sp- 


The fact that the coefficient of some associates is zero in these split-plot designs allows e 
problem of solving the equations to be simplified in many cases. If the matrix А 18 writte 
down ordering the elements in groups according to the j subscript for the randomized blon 
design and the k subscript for the latin-square design a matrix of the form (7)is obtained. " 
consists of submatrices along the main diagonal, one corresponding to each j ог k, and Ze n 
elsewhere. As these submatrices may be separately inverted to give the required pue 
considerable reduction in the labour of computation results. Further, in many simple ler 
the missing observations may be fitted quite independently using the standard formula 9 
one missing observation. The independence of missing values within whole plots in t pui 
designs has been pointed out previously by Pearce (1953). If an entire whole plot is missi"? 
the х t submatrix will have (r — 1) (t — 1) along the main diagonal and (1 — r) elsewhere. ^^. 
sum of a row is zero, and thus the submatrix is singular and there is no straightforW?" 
solution to the equations. b 

Example 2. Table 8 shows the results of a split-plot experiment where the whole PI? 
treatments are arranged in randomized blocks (r = 4; t = 5: s = 2). Missing values hav 
been artificially chosen to provide the example. Table 9 is prepared from the data of Table Dn 
in exactly the same way as described in Example 1, except that the subscripts have pes 
ordered using the middle (7) subscript. From the matrix A it is seen that the estimatio” $ 
2121 апа Tox is independent of the estimation of 2159 апа т). Thus instead of inverti”: 


a 4x 4 matrix we invert two 2 x 2 matrices, considerably simplifying the problem. 
The estimates are given by 


deu 1 ^4 |5 


and (іі = us 3 4 2-56 
Togad 1 3 al 


Thus 2% = 1:19; 255; = 1-19; 2, = 1-43; Тоза = 1-73. 
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Table 8 
. The wet Ч : * 
defined dea weight (4 mg (x) of embryonic chick tibiae after cultivation on а chemically 
еб нн ium containing different concentrations of glucose or mannose. Treatment 
ire placed in parentheses. The variate analysed is log 10% 


сана || 


— 
Whole-plot treatment 
Block Split-plot Hexose concentration (mg./ml.) 
treatment 
| | 
E ова) | 190% | 20 | 409 8-0 (5) 
қ — — ne À 
Glucose (1) жа бы Ж. 1-39 1-51 
| Mannose (2) 08 | 15 |= 1-43) 142 1-51 
IT 
Glucose (1) L06 | 1-22 1-63 1-54 1.37 
Mannose (2) 1-09 (ша 1:19) | (=: = 1-73) 1-47 1:44 
In 
Glucose (1) 0-97 1.34 1-44 1:66 1:59 
| Mannose (2) 0-90 116 144 1:48 1-50 
ТУ | 
Glucose (1) | | f 1-16 1:50 1-48 
1:09 1:21 
Mannose (2) 1.07 | 1.32 1:36 1-41 1:41 
| 
Table 9. Calculation of ^ and q 
q 
Subscripts A ПЕ ағы 
121 292 132 232 Wa I. . 5i 
(Же. | ee Лайс с, ылғы 
191 cx. 7 am 7-40 
à й 3-77 ' 
222 3 1 0 0 ЕК 3-63 7-40 
132 1 3 0 0 133 № 280 5:36 
239 0 | 0 3 -1 % 2:80 8-86 
К. o | o 1 3 p 
ps 
s= 2 —1 
Бн ‚ eat 
1 
gpRVATIONS ; 
үш. Maxxp-U? 0% only know their total. These 


Meti 
Г ow | + we d 
the Own Р identity of observations may be lost 8° pond Mr жу 2А 5 
аа ysis mixed-up values. Bose & Mahalanobis designs where some observations are 
e of randomized block and latin-squ? e = ve teh Oe re VERE 
ay 


Ne, Up, " 

А Squa; nd they have shown that esti mates m | 

) : ау occur We шау relate this to 
m p Wi | 

= ane in| observations ar 


th, e 
e e | 
eng vious шш case both missing and mixed-U nis : 
Conn. > but үш by supposing that some of ШЕ p pA ka eae? 
4 hat certain of them have een mixe 
Y c relations of the form: m 


i wey Did 
Leydig Xi = Ш (% =l, 9, + p 
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i issi 7, ате 
he coefficients of (9) are either 0 ог 1. For example, if p = 5, апа the missing values " 
co | | was ү 
= E жә, X43 ANd 210, with >; and x, and also x, and туу mixed up and 2,, entirely missing 
Lo, Tys Тө» 413 , 
the relations: 
uve 0.2, -- 1.25 4 1l.my-- 0.245 0.9 = Uy 
and 1.2% +0.27 +0.29 +0. £13 51.4 = us. 


НЕШЕ cor mean 
То obtain estimates for the missing and mixed-up values we minimize the error s 
square subject to the conditional equations (9). Although the coefficients of (9) are iym 
to 0 and 1 to deal with the case of mixed-up values, this restriction is not essential in 


following discussion. The problem will be treated generally in matrix notation. 
Equation (8) may be written: 


Bx-u 00 


it 
: В : s А . : eum 
where B is a p x c matrix of coefficients; each column of this matrix will contain on 


Р uis 
element and (c— 1) zero elements, x is a column vector of the p values to be estimated, 
a column vector of c mixed-up totals. 


If we introduce a column vector (2^/№ 


А agrang? 
) of c undetermined multipliers the Lagra"é 
equations are given by 


9 р 
ax Uns тә ...,2,)) + 2A (Bx —u) = О. 
Using the notation of equation (1) we find that 


2/N(x'A—q')+2/N2'B = О, 


and after transposition and rearrangement ) 
11 
Ах + B'A =q. ( 


The values of the р missing values which minimize the error 


vo py th? 
mean square are given АР, 
simultaneous solution of (10) and (11) 


H 1 ! 
- These two matrix equations may be combined 408 


the solution of which is given by: 


Ш-БЕГЕ. 


3 
1 
А [o 
0 
1 
B [5 
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The h 1 
nt B A а 
е missing observations are given by 


Түз 1 0 -1 0 3 —1 4-74 
"ons 0 1 0-1-1 3| |474 
walaza 2E 0 1 2-56 
tue | * 0 -1 0 1 1 3 3.76 
4 4-1 3 1 -8 O| |290 
25 -1 3 1 з 0-8 2:83 
Thus Ті = 1-14; 
1 } Жээ» = 1-16; іе = 1:36; as = 1:67. 


І W: 
s on the draft of this paper.* 


ish 
to thank Dr D. R. Cox for helpful comment: 
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› and d; 
discusses the analysis of those сазе 
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NOTE ON A THREE-DECISION TEST FOR COMPARING TWO 
BINOMIAL POPULATIONS 


Ву J. H. DARWIN 


Department of Scientific and Industrial Research, Wellington, 
New Zealand 


l. INTRODUCTION AND SUMMARY 


Suppose that in each of two processes there is a probability of an event happening and it " 
desired to compare these probabilities. Pairs of trials of these processes may be made o 
sequence and a test made on the results. Тһеге may be four possible outcomes of each em 
trials—the event may happen in both processes, in neither, or in only one. If the result of re 
test is to be that just one of the processes will be used in the future the first two outcomes к 
irrelevant. If the result of the test is to be that just one of the processes will be used in ar 
future ifit is markedly superior to the other, but that either may be used if they do not appe 
to differ importantly then all outcomes are relevant. atly 
In this paper we produce a test for this latter case. Another test of this kind has an me 
been described by Armitage (1957). He does not employ pairs of trials having the МƏ 
outcome and consequently has a different form of test. Both his and our tests are trunc? 
in the sense that a maximum number of pairs of trials is needed for each. uir? 
When the probability of the same result for each process is well known, our test will rd od y 
a smaller maximum number of pairs of trials for the same power, and the same probab! eri 
of rejecting when true the null hypothesis that the processes are equally effective. Annum 
cal example of this is given. 5618 
When this probability of the same result for each process is not known Armitage's je e 
preferable since it does not depend on a knowledge of this probability. When, howeve^ jt 
tests are being used to compare a control process with a new process our test which, Л 
construction assumes some value of the probability of the same result, will be affected 9 
favourably should the control value of this probability depart from the assumed salu n 
The strongest connexion between our test and Armitage's is that the boundarie? wwe 
determine when our test stops are a special case of the boundaries required by his test. ge? 
use exact matrix theory to find the probabilities of the boundaries being crossed, and he" a 
approximate normal theory. For some comparative examples we calculated, the E ab 
bility given by this approximate theory differed from those 


given by the exact theory 
most 0-0015. pe 
+ а 0 
We also give an approximate set of power contours which enable tests of our tyP® : os? 
constructed for any value of the power and of the 


es 1 
probability of rejecting the null hypo’ 


when it is true. 


2. DERIVATION ОЕ THE TEST 

Suppose the probabilities of the events hap 

Then in a pair of trials the probability 
in the second is p,(1— рь) = лу, say. 


pening are р, and p, for the two process, pot 
of the event happening in the first process 


The probability of the same result in the two trials ig Py Pot(L ваў 


= р!) (1-22) = ™ 
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The probability of 
probability of the event happening in the second process but not in the first is 


cte = пз, Say. 

Ку a variable y taking values 1, 0 and 

of y, We ey a -- Vm benn made in sequence and y(m) is the sum of the m values 

where n is agine y(m) drawn on à graph as a function of m. Suppose у(0) = (2+1) 

Then the probability that y(m) has the value Y is Py, m 
(1) 


s that to use in the future. 


—1 for these three cases, respectively. 


an odd integer, and suppose 
Ify(m) crosse у Ру, ть = m, Py a n Тү, mt Т Рузът 

у(т) ы” "д = 0 we shall suppose the first process is accepted a 
Neither of the. іні n+l we shall suppose the second process is accepted. Ify(m) has crossed 
Tegarded as c - oundaries when M pairs of trials have been made the processes will be 
Values of M qually acceptable for future use and the sequence stopped. Wenow wish to find 
accepted la i for which there is a high probability of a markedly superior process being 
Same Wire b high probability of either process being permitted in the future if there is the 
System (1) Шу of the event happening in each process. To find these values we solve the 
with P, — 0 for all integers 4 and b greater than 0; i.e. for ym) 


с 
Onfined betw dcum n+b,m Е 1 
een Land n. The Py, m satisfy the matrix equation 


\ 


п Tg 
пу То 7з р | 
Рина = | | | ш 
Ty Ta Ts 
m Т 
Where P; = Р.Р, Says 
18 the у 
Em m е vector of the Py, m i: | 
Pose | D, — АЈ is called D. (д). Then D,,(A) satisfies the difference equation 
т 7 
fh solution рд) = (п =) Ра)" эйе à 
- 2 — 477, TN: ien 
2.0) = Gr, AÀ MEME A) 173} d 
a cu T a annal 
Y fo 
llows that, the latent roots of Dn are 
(3) 


тт 212,5? 
л, ттт) тт 

ut prf 0]. 
responding to A, 5 


% 
t 
hen follows th. эш аў 

at РА) = (т mg) sin Ul atisfy the equation 


e el em 
Y ents of the latent column vector Cr °° 
tth фае 
з А); + 113 0)+2 ; 
the Ph follow пуху + (па нат? atf ра 
Е Sleme that the jth element of б, proportion? nding to A, 5 P^ 
ылы Met. of the latent row vector В, corres 
Ma, есе TT. 
y та, 
Write lresolution of the matrix D, is then P^? 
Dg = X OR, 


and, similarly, 
portional to 


iple (see 6:5: Bartlett 1955, P- 26) and we 
81 is 


(4) 
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Ў З en 
The probability of y(m) crossing Y = 0 up to the time when M pairs of trials have be 
made, if y (m) has previously been between 1 and n, is 


M-1 п " 
У Pints = пз У Y APC. В, инь (Cr R,) 
m=0 m т=1 
2 m) 0-D n рдм. . orm (8) 
= —— п. — У - sin — sin. 
nmel "Um r=1 1—А,„ n+l 2 


д hen 
Similarly, the probability of y(m) crossing the boundary Y = n+1 up to the time W 


М pairs of trials have been made, if y(m) has previously been between 1 and n, is 


2 mM n 1. XM оо (6) 
AX — У —S sin—— sin—. 
өтт) Ха sin- сүзіп 
: А aving 
The probability of y(m) crossing the line т = M between У = 1 and Y = n without Ва 
previously crossed either Y = 0 or Y = n+1is 
9 7, 1-0 m, E-D) n E emo к (1) 
m m^) Ran ны. 


3. CALCULATIONS AND DISCUSSION 
Values of M and n given in Table 1 have been found to 


(i) For some 7, and for a value of п1/пз greater than 1 
to 0-95 or 0-80. 


1 
eque 
le 8 


(8) 


satisfy these requirements 
; the probability (6) was mac 


г 

4 | (8) 

(ii) For the same T, and for т; = ту the probability (7) was made equal to 0:95. rene? 

That is, the probability of accepting, when true, the null hypothesis of no nmn of 
between the processes, is 0-95, and the power of th 95 


€ test when 7, and л, differ is either 


Table 1. Distance between acceptance limit 


8 (n) and number of trials (M) 
which will ensure the power indicated, 


at the 5% significance level m. 
Power |п/пз(=5)=1-2| 14 16 | 1:8 2290 2.5 
| | ғ 
| 31 
7, = 0:25 0-95 n 181 99 71 57 49 37 68 
M | 2200 | 656 | 340 | 2200 162 95 26 
0-80 n 141 77 55 45 (43) 87 29 43 
M |1336 | 399 | 206 | 136 (131) 96 59 гет 
Td [эё 
31 
л, = 0-50 0-95 n 181 99 71 57 49 37 | 109 
M | 3301 | 986 | 511 | 331 244 | 144 25 
0-80 n 141 77 55 45 37 29 65 
M |206 | 600 | 311 | sos 145 89 ЕС. 
н - " 
7, = 0-75 0-95 n 181 99 71 57 49 37 | 909 
M |6606 |1972 | 1025 | 6% 491 | 290 25 
0-80 n 141 77 55 45 37 29 | 132 
M | 4018 |120 | өз | ајә | 292 | 119 
L ; | 
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0-80. Тһ , 
- Lhe conditi Ss des Р s 
was 1 to 3. ditions (8) determine unique values of М and n. The range of 7/73 considered 
That у 

at value — А " 
(i) and (ii) ei 5 п was chosen for which the difference between the values of M for which 
of M was e ) were satisfied was a minimum. Then for the sake of definiteness that value 
one place s р for which the power was closest to 0-95 or 0-80. The two values of n given at 

ы. 1 ар ` а : . 

nder 7,/74 = 1-8 indicate that n = 43, М = 131 is the better choice, but n = 45, 


Т ы 190 Менімен ouni ee 
en put in for comparison with the lower figures in this column. 
since in all cases at most six 


n 
Th 
е caleulati 
and usuall VE were not as laborious as might be imagined, 
ally three ter Mee Bro FEN = = 
y three terms of the series in (7) had to be taken, and always for ту > 75, (5) was 


"ері 
n (6) 
m (^ ta-o sin лп D) sin do _ Ен " 
n+l Иль E ТЕ, по при" 


when M is infinite. We used 


Since +1: 
nce this is the 
were keeping. 


US as ap m) crossing Y = "+ 1 
r 


known probability of ?( 
umbers of figures we 


artis қ, 
have. е, ien check on the sufficiency of the n | | 
ате good approximations available to the probabilities (6) and (7). First, there is 


Пе n 

can Mo approximation. In this y(J/) for large M is taken as normally distributed with 

Oundary Y. лу) and variance M(1—75— (1^ та)». Тһе probability of y(m) crossing 

Ne pati = n+ 1 before M pairs of trials have been made is found (after consideration о 
hs that reach т = M with Y less than п+1 but which have previously crossed 


9+1) be 

1-р| Қаза -Қа--1)-(т =") | 
ACC = —3.) J 7 = RUIT a сикы сл 

| L) = (пута) М | -— Е +1) (т Д ат E = ЙГ (10) 


Wh п-т, п) п-т") 
ere 
hi F(x) -f е4: da] (2m (11) 
S re. А Е 
Powers lt 13 quoted by Armitage (1957) from work by Bartlett рий. Vesp = 
Щщ К gave for the values of n and M given by the first part of e га и min 
the 5 With the powers given by the exact quantity (6) and in none 0 


eren 
Се as much as 0-0015. —" dering (7) directly. Because of 
n by 


Noth, 
9 er iugis 
(8) We ma, approach to an approximation 18 givel 
Y write (7) in the form 
Р ‚тт 
A ss АМ n LL sing (12) 


Now sit i= 


Sup 
pos 
€ for convenience we denote ml Е" 


% 1-3, (n) (17 iad 


Tg 
Mal] 
ғ 
and large n we шау write this аз 


expe 248 2 J8 т2л? 

12111 = E A. 

É ( g) e ies 2 ) пав e m 
= TM, ә [9 эл? me гез алі 
(| EE 34810 +5) 
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11 


5 the 
= + -8660 to 0-9959. For 
he ran i E 2,/8/(1--6) goes from 0:866 НЕ. 
Е S considered, З to 1-2, 1—2 J ite. 
or the ж | " IE this meant that the last factor of (14) Ес с px к. 
мен; se ss W i lues of r less than 7 so that r had a range all 
. Weonly had to consider values | ; cue. d 
ind н а = iia Vu we may write the denominator of (12) approximately 
independe А as 


6 77 sin (177) (19 


А RT - 24S +5у+ pens) We shall us 
2 р z. We 
This exhibits the denominator as a function of (n + 1)? (1 — 2 4S[(1 4- S)) only 
this approximation below. Again, M log A, for low r is 


approximately ; 
ren? 2 MA (n, т”) | (16) 
Н ger тат 
= -Mü-7(1-248/(14-8))— $M (1—7,)? (1-2 48[(1 4.8)? an 
EL EE sir mL ENG 8). 
2(n+1) 1+8 abinatio" 
The two leading terms of (17) each have M and Тә occurring only in the com 


zl 
icular if 8 = 
M(1—7,). The remaining terms are proportional to 1 —2 /5/(1+ 8). In particular 
1-24S[(14-8) = 0, and the approximation given by t 
We can in fact write approximately for (7) 


4 е% 6-%/ (18) 
e a a a ah 
7T 3 5 


s 000: 
he leading terms is especially g 


where f = 7*M (1 —5)](n 4- 1). 
Suppose now we have found 
тә, Бау 0-75, and for S not equ 


The value of f making (18) equal to 0-95 is a of 
М and n for the test to havea given power for a hig 


л 
s er the 
al to one. Then M log A,(0-75) огт, = 0-75 will ares: 
M, log A, (0-25) for Т = 0:25 and M, = 1M because both the second-order approxi tel 
(17) have a negative sign, However (M,—1 


sme 
)log A,(0-25) — M, log A,(0-25) is approXi? o 
97190 — 2/81 +8) 127075 JS ( +8) ( 


is of the 9^ 
^ + 1j and this in our examples is of d 
order as the second t 
balance the negative terms, This 


e 
erms in (17). In fact a drop of only one or two from M, was "i je t 
explains why in Table 1, for fixed S and n we 97 
achieve the same power when M(1— т») is kept approximately constant. 
We may write the term 


rn? M (1—,) 248 
9(®+1# 134 


) 

o 

?75М(1-л,) 248 

Е (17 mi. oh. zee 

of (17) as ТИ ( ( 7 3))- int 

The term in 1— 2/8/(1+8) of (19) is of the same order as the second-order term in! ate 

now appears from (15), (17) and (19) that the power (the complement of (7) is appro* 

a function of the two quantities 20) 
М(1-т,) | 


NCESTÀ and (n4 1)? (1 —-248[(1 +5)). 
The second-order term in ( 


a 
17) and those neglected in (15) offset each other to 50129 К wis? 
If they are neglected we may, following (20), solve approximately for the M and ? T 
by (8), the equations 

M(1—7, 

СЕЗУ — constant, 


(n+1)?(1 —2,/8/(1 +8)) = constant, 
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For the values 
тесі. 2 s of n and S from Table 1 for а power 3 : 2 
ra I +5)) takes values 137-19, ud. re bs Mer n 
Et m= dl P of 0-80 the values are 90-57, 86-86, 82-58, 38-21, 84-65, 85-10 and 33:50 
0.0510, 0-082 "s s Lagos of 0-95 and for increasing т, M(1—73)[(n-- 1? takes ud 
аге 0-0488, pcs p? | POMA 0-0494, 0-0493 and 0:0499. For a power of 0:80 the values 
_ The variations 7, 0-0506, 0-0487, 0:0497, 0-0494 and 0-0498. 
In arriving at (20 in these quantities are caused partly by the influence of the term neglected 
odd. This mean ), but mainly by the fact that M and n have to be integers and n has to be 
exactly those nr tek Bal powers for the values of n and M given in the table are never 
decimals фе ated: They diverge from them by up to four or five in the third place of 
We кимеу) w n and less for large n. 
starting " ind expect (20) to hold if S were to increase much beyond 3, since 921/8/(1--8)ів 
One, diverge too far from one for 1 — 2//8/(1-- S)to be considered small compared with 
ITH ARMITAGE'S TEST 
regardless of the value of тз. Our test 
If our test is designed for a given 


xpected that our test will 
numerical 


Armitage’s t 4. COMPARISON W 
has a ditum est has a given power for any fixed 7/73; 
Value of д, poe for a fixed п /пз and a given value of л. 
Tequire а, -— this turns out to be the correct value it can be € 
example of ü A maximum number of observations for the same power. Wegivea 
n his T; - А 
a Dower oe 5 is given the maximum number of р 
Comparison E and a probability 0:05 of rejection 
Scarded with our test we divide this maximum numbe 
"umber p pairs of trials in which the processes gave equal r 
Constant 5 = value of M corresponding to this value of 7% 
Уробһеві 1-27. This represents, for this power and probability of rej 
ті is, the relative total number of trials required by the two tests when the а: 


2186 
Qual to the actual л. 2. 


Upp 
Ose, h р 4 
construction of our 
: ; however, the true value of 72 at assumed i m ком 
he power v ecrease, 8 ег 


86. Then į 
tt wil incr ifthe true value is bigger than t ў pes 
7, varies җы We can make ап estimate of how pig the change n power is likely to be when 
М т» y recalling the approximate result (20). If ake id values of 
Present I map the power function of the test БУ varying d S. We have done this and 

= € result in Fig. 1. From this figure we can get approximate values of M and n for 
Зер ins hich the power takes any value between 0:50 and 0:99 and the probability of 
Fi € of a true null hypothesis lies between and 0:993. 

i if т, is muc 


8. 1 
ag sh 
Sum, 2078 that there may be а mat 
е test if there js no 


nequal results to give 


airs of trials with u 
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EFFECT OF NON-NORMALITY ON THE POWER OF THE 
ANALYSIS OF VARIANCE TEST 


Bv A. B. L. SRIVASTAVA 
Indian Institute of Technology, Kharagpur 


l. INTRODUCTION d 

А number of studies have been made of the effect of non-normality on the test functions use 
for the analysis of variance. The effect on Type I error, i.e. on the distribution under the с 
hypothesis, was studied by Pearson (1931), Geary (1947) and Gayen (1950), and that ү: 
Type П error, i.e. on the power function, has been considered in а study by David & J um 
(1951). Pearson showed that while both ‘between-groups’ and *within-groups' mean square 
in the case ofnon-normal variation, provide unbiased estimates of population variance, P. 
are no longer independently distributed; in fact, their variances and covariance cont! e 
а term in A,.* However, he reached the conclusion that non-normality would not bie 
a serious effect on the distribution of their ratio w, in large samples. Considering the effect T 
kurtosis only, Geary (1947) gave an approximate formula for the probability correct? 
for w, based on the large sample assumption. Gayen (1950) derived the distribution of 10 А 
non-normal populations specified by the first four terms of the Edgeworth series. He "p. 
it to consist; of corrective terms in Ла and A$ in addition:to the normal theory proband 
density function of w. Assuming that the observation т; contains an error term е;; which т. 
a distribution of any form whatever, with all the cumulants existing and varying f 
group to group, David & Johnson (1951) obtained the moments of the distributio? n 
а certain function of the observations which makes the study of power possible in the va 
normal case. They did not, however, make direct use of the distribution of a non-ce? 
variance-ratio as is done in the case of the ‘normal theory’ power function. ij 

Tang (1938) and Patnaik (1949) studied the power of the normal theory Х?- and F He e? 
by deriving the corresponding non-central distributions. The non-central distribution аб of 
when the hypothesis of equal means is not true. The effect of non-normality on the pov 117 
theanalysis of variance test can likewise be studied by investigating thenon-central dist? ab 
tion of the variance-ratio. This has been derived in the present, paper, on the assumption T 
the distribution of the error term is represented by the first four terms of the Еавемо д, 
series. In addition to the normal theory power (Tang, 1938) 
and A$ have been determined. In deriving conclusions we Һа; 
the values of A; and A, within certain limits ( 
Edgeworth series is to represent а positive definite and unimodal frequency functio”: 950) 

We have considered here the simple case of k groups of n observations. Gayen m 
obtained the distribution of the variance-ratio assuming unequal number of obserV? 0? 
in different groups, but we have avoided thi 


of more parameters of non-centrality and would make the results more complicate e 
results obtained here enable one to calc 


ulate the effect; of non-normality on the prob?" | oe 
of the error of second kind, and hence o yo ее 


, the corrective terms i? sonl 
ve keptin view the fact tha” 4b? 
Barton & Dennis, 1952) can be permitte 


variables. 
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the 
power aris 
rd ises from the fact that i i 
t арреа; act that it contains confluent hypergeom i i 
pear to have been tabulated so far in detail italy а 


2. р 
=. IST. d 
TRIBUTION OF THE NON-CENTRAL VARIANCE-RATIO 
тј = 12% k), and suppose that 


Let us Б 
consider k groups of observations ау; (i = 1, 2y 
e a is the grand m P К мүм NM 
> у= 0 and e; Mine B; the deviation of the jth group from the grand mean so that 
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gher order cumulants to be zero, we can express the distribution of the 


standardi 
ardized vari 
varia af = 
ble z,;( = eio). by the first four terms of the Edgeworth series as 


^ 2 
Where бү) fhe) = gle) —229(0)+ 540%) +23 ge), 
2) 15 the 2 
ow ‹ beteen, standard normal function, and Qo(z) its vyth derivative. 

-groups' and * within-groups' sums of squares can be written as 

зы а-а её” 
and gu cm -naX (2; 290^ where 6; = В;/0 
І E 


(3) 
(4) 


kon k mn Xam 
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jail 
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(k-1) Y] 


such t 


E In thean 
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E а ези allj) in order to determine t 
herwise, 1 rg value а. We reject the 
7 is not true, we can determine 
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pr ili 
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alysi А 
"dde of variance, as is known, W 
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of › Ш orde: F 
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ang Sy = Saute So; = Ble ) (6) 
k Le 6 
80 ry = Жы ср (6) 
that x -Х 4% = 58 А луд j 
ing Sing the į з— ХШ, where N = nk. паб, by S35 — 19 
joint distribution of Sy; and 55; which 18 obtained by replacing 91) үрү н) 
1) 2j teristic function of Xy X2 and Y has been 


oint charac 


deri “result (2. 
Ved ag (2-11) of Gayen (1949), thej 
zit A/Z 2it)) ris E {З3Атув + 2T 22} 


$t 


bis, и) " 
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iti iy 8) 

разв "тп Sn | 
k k Е (9) 

х=}, pena and кенжар ; 


© = BÀ 
and where Фе(!,, fg, u) denotes Gayen's (1950) result (2-6)ї, in which we have taken k’? =} 
Obtaining the joint frequency function of Х,, X, and Y from (7) by Fourier's inversio! 


theorem, and integrating out X,, we find the joint frequency function of X and Y can be 
written in the form 


GX, Y) = C, X, Y) Ag, С.Х, Y) Augu С.Х, У)+А aX, Y) (10) 


the typical term in it being (У n) VG n), all | 
um $(Y,n) = Yinigcir (12) 
маги) 
А Жо ИЕ | 
ВЕ YX, r) =° а eo Хы " 


j= 2 (Ary jt 


We can now deduce the frequency function of w = [(N — k) X]/[(e— 1) Y] from (10) in the 


form x 
g(w) = go(w) + Asa (w) + Aaga Qo) + NBG ла (и). 
Using p(w; r4, т) to denote 
$ T ebay Даун Hi winti-1 (19 
ол BU, +}, bra) vp TE jue 
+ M 
where v, = k—1, >з = N — k are the degrees of freedom of X and Y, we find 6) 
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ga) = &ups(w; Vi, Vg) i 
1 
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Where 
alw: r.r) = . Р 
Ре; ry r3) = p(w; r,+ 2r, 79) —тр(%; 7+ 2r—2,7,) 


r(r — 1) 
3 p(w; ri + 2r—4, rj) at (— 1) p(w; 7,7%). 


(20) 
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ase (a) the value of и will be ze 


For a fixed A, in the c 
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Table 2. Giving the values of В, (шу), Py (а) and Pyx(wo) (of Table 1) 
in the cases (а), (5,) and (ba) 
и = 4,0, = 10 и = 4,9, = 20 
Фф Саве Е 
Py (wo) Py (и) Руи) Pa (wo) Py () 
0-0 0 0-00376 — 0-00429 0 0-00241 
L0 | (а) 0 — 0:01289 0.00476 0 — 0.01143 
(b) £0-00465 | . 0.01313 0-00479 + 0-00604 —0-01177 
L5 | (a) 0 — 0:01968 — 0:00345 0 — 0-009851 
(b) = 000350 | 0.02050 | _ 0-00415 + 0-00094 — 0:01064 
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30 | (a) 0 0-00520 0-00068 — = 
(b) F 000279 0-00561 0-00068 = = 
eel sl 
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It is again seen that when Аз = 0 there is practically no difference between the values of 
and (b). When A; = 0, in the three situations (а), (b,) and (bs) that 
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hat of kurtosis Which is, in general, high. Тһе presence of a f: 


В š қ M : yower 
as Is not uncommon in practice, leads to a noticeable change in the ] 
curve particularly in the case of small 


respect of kurtosis (say, of the order A, = 
viation in the power, 


beginning (for exam 
when y, = 4, v = 10 (Fig. 1)), 


s qo 
his regard cannot, however, 


which the values of A, and Аа а 
ary. But on the whole it may be said that from th 


normality on Power will not be of much consequence 


practical point of view 


» the effect of non- 
in the cage of near-no 


тта] Populations, 
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THE z-TEST AND SYMMETRICALLY DISTRIBUTED 
RANDOM VARIABLES 


Bv F. N. DAVID* 
University College London and Statistical Laboratory, University of California, Berkeley 
Some years ago in two papers (David & Johnson 19516, 1952) the exact cumulants and 
cross-cumulants of the distributions of the sums of squares usually associated as test criteria 
for the general linear hypothesis were given under extremely wide assumptions. It had been 
the intention, at this stage, to substitute in the given formulae the expressions associated 
with each of the analysis t-ups and thus, by taking appropriate numerical 
values, to obtain some ide and without unequal group 
frequencies, on the F-test as com monly applied. The arithmetic, however, proved formidable 
and an algebraic alternative Was sought for. This algebraic alternative has proved of use in 
associated with the genera esis. It is proposed therefore to 


l linear hypoth 
o illustrate its use on the F-test of the one-way classification, the 
usly been discussed by Pearson (1931), David & Johnson 
hoped to present similar formulae using the cumulants 
s of squares in à further publication. 


ok observations with 


of variance ве 
a of the effect of non-normality, with 


other problems not 
summarize it here and ti 
non-normality of which has previo 
(19514), and Gayen (1950). It is 
for the general linear hypothesis sum 
Assume k groups containing ny %2 
ў И 
t=1 
-way classificatio 
] колм 
k k m EE Ше 
к ge P+ 2 eae) = (zy; — &.J*- 
д вии. -2.) +2, 4 eu 2 à à ^ as А 
Usuall j ach to follow the same normal distribution. We shall assume 
that Arva aiii - «i bution. There are no real difficulties introduced by 
i : that x has а different 
E generalizing ие sina к f method and the symmetrical 
asymmetrical distribution. The purp rrt шиша. 24 


distribution leads to shortening 


n leads to the identity 


In commonly used notation, the one 


5 zy 
zy and Seo= 4 4 (xi; 7 9? 
and S, from those already 


then we may write down the cumul 


given by David & Johnso 
е (8;) == " 


We define =ч 
(S189) 


ч = п\ 
к ее ү 


т 


and 
Vac = (n— 0) 
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It will be noticed that Vote = 1 when n, = nfk,t = 1,2, ...,k, From the printed cumulants we 
get, putting the odd gammas zero, and rearranging 


- y Va ZR Varo 
К(5, 5) = 7, KS) = Vas wu 


2] 


"WC 
M EI + I 


2 
T 
1 


Л 


Ve Vow 
K(S}) = T + dy, Nd 


К (S28, рн Vias Va 
(Si A ш 


F, V, 
K S 52 жемеу 221 111 
(8,83) Yam TRE қ 
[А о, 8 
K(88) «y, 19), Vas 8 
ML + nfa fi 
У y 327 7 
К(81) = Ys S + 24у, = уБ 4 144у, More, 48 
пЗ KZA Үй nf, Уз; ? +7 " 
K(S18,) = DUM 12у, Мө» +73 LU 245 Ин 
т n?f, nf, “ Уг» B 


K(Si82) = fg BEA Lm 2 (s ау 16y, V, 
y 2 211 Ya" 
S ‘nfi f. з nf, f, aci) Mo 


"ЛАС 
KID = уйы із; Ма, „ет ор y 
ur Vang + ys Done PETUNT 


LRL EE З 


қ 
—210 А 48 
nf nfa пра +уз) +144, 04. 


nfs AU 


к, 
2 +++) Ky, ... and the normal curve 


Mean and vari hab 
: ariane wi veth 
nit norma] deviate E S ABS таш 

> 


given by Say, of the percentage point, is 
2n~ Ki _ ё = (£2 1 1 
Жын Е 9 дар c. Е 
where 1 


= м 
T d. (ғ > 9). 
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Provided the moments of z are not greatly affected by non-normality or by unequal group 
a Fisher-Cornish expansion should not be adequate in the 


frequencies there is no reason why 
at the departures are of such order for this to 


non-normal case. We shall assume therefore th: 
be true. If Ly. Ly. ... refer to non-normal z, z4 (say), then let 


=, ac hes = Ка (ғ > 9). 


КоА Т, ‚Ж : 
(ку)! ý Kex (Kax)? 


ке 


—Cornish expansion is 


In this case the Fisher 


(2&8 — 5E) +--+ 


Lj£ - 1) + Mal t dI 739.7 5 


Fe 
— 58) [Spe а) 


d from the differentiation of the cumulant 


can be foun 
ple. Wishart (1947)) 


If desired the cumulants of zx 
generating function (see, for exam 


4-4 FW 
= og (5527) 08 (е) nec 
| | it — 2 „—2 В 
—log (^ at a2] +log (EF) + Нов (3/2), 


ut this is not really necessary for the 
for the polygamma f lated percentage point of z, given, for instance, in 
difference. We also note that Zx is the tabula 2 obtained provided the differences of the 
Fisl T і j imati c, is of order n-* 
her and Yates’ Tables. 24 We Е of approximation. Kay i ern 
Cumulants of я апа гу ате known to 2 sufficient order oF ; SR equal. 
a? N 
and 1, and Г, are of order тізу when 


unctions are well tabulated, b 


log (Sil) —log (8) 


By definition di иа 
5,—6(85) 
9-69) _log (m қ 
or 32 = log ( T S) | 
i d we have 
i d Ss; respectively: an 
№ rr ta adardized, S, and Өз 
rite д, and 6, for reduced, star siti 


2) 4 1(6#— 63)-4 
MM id | 
id à yhich were given earlier. Taking expecta- 


f whi 1 
8. © cumulants 0, аге put equal to zero are 
ee S EO quanti е te remainin m ! 
8 and remembering th® f normal 2 we 
t 5 value of ? 
hose belonging to twice pene 373/72, - Và + Tas e +... 
Loe шю ол \ Л 


2% (Voss 7 Jo) — 8 n? 


- (уа Vas * 6n? 
4n 
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d 2. Following the usual procedure we obtain for the higher cumulants, 
to order л. 


А — (ess — Vion — Vans + Vion) 
Kea fav = dg o in + Vas) + | 22 (боз 122 — 521 + 1520 


2 е 72 
+5 (Vae + $V 349 — 2a W- 2E = Vin) 
n 


0 (sz) + Vaio (+z) 


А 3% 219 — 2V) tees 
Кза — Kay = 2%, (Voss — ЗИ,» + 3¥ 201 — Ио) — 8 x (Vor2 — Varo) (Vos + Faro n) 


Ka — Kay = id (Ks, — “а 615, — 403, + Va) 


->---- Т" 
€——————— 


si ым (б э(—5Й 5+ 69, — Voro) + (= БИ + 69, — Va) 
р р ay 
аа — Зо 601) + (9 — 395, — бИ) 
3 
HS (572+ 573 0-+273 +3 У (Иә + Va) 

7 Va (Ра + Vio) + 6И Varo Mons) 


dere G2) (2-2) n ж) 


гз 3 1 қ 20 
F га ( К т (ois -ha? @ 2 (Varo тз Vin)? TE (4% E 6Й F aer) 
аз (вл, — 140, + 3V, 1) +2 (ais = aot £): 
Js 320 221 BER fi fe 
The second and third cumulants are of order n= and the fourth is of order n-%, 
Ifthe numbers inth 


"S 
€ groups are all equal, the V's are all unity and considerable simplific 
tion results, We have 


У [11 
eH) 
Ж! 
Koa — Ком = (7+5) os 
1 Je 
Kg4 — Kay = 0+... 


(-5-5,% " i X 8 lm ж p 
eR R RI "Rz )+ 


then 
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We take the expression for Vase» substitute for n; and expand to get 


Қы-іт?” Есі ack ар Жан, жез A 
4 f Mn R CH ар, 
Із i 
HPNOILOUS — aC, eC Em «Q0, — mr E s Оз Ero, ix 
мы (еҙ vi АЙ "n un C СЕ 
NS! а 59 —в+“ a = nf 
atag A s M 
ыа 
For example Vaa-l7 а, Љ + аб j^ ii 
If k = 5 and т = 20 this gives 03 
5 
ha = -ig та”) 


uencies the first term involving v 
pstitute for Vave and collect terms 
as described by their variance, to 


which suggests that for all but widely different group frequ 
will be adequate, and this is so in fact. We therefore su 
which enables the effect of the unequal group frequencies. 


be seen immediately. — 
y l 1 
Let А, = (5-3) 


nr 


Then on substitution for ya, We have 
n ,U 2 
ыда? yaba 2 Ё (y, Аа Аа x im 
Куд Ki = -5 MU RET угда 0378 е а 


фа ы 


Koa — Кам = 
aN 4n? 


he: | By BR зук шәт 
-pmke 4 
ok? 


" 2 pA ay 
-i um 272 


Кад — Kan = 
зуй 4 оу 
Ku ky = = Ві? 
4 m n 6 Зв 4 UP y 5, 
ТГ. (ea dU ARP RA 
and | т 5% бт 
ЖЕСІ 


n? 
2 п 4k ps 6 
ИЕ i 
7 т^ 


2/9 
ECCE RR 
ВЛ АМ АЉ ibo groups are equal, the cumulants reduce to 


When v — 0, that is when all the numbers in 


t 
hose previously noted. 
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We consider the case of equal group frequencies first. Refer to equation (1). substitute in 
thecumulant differences and collect terms of like order. The difference between the tabulated 
value of z and that of non-normal z is to order n-? and for k reasonably large 


НЫН ЕРЛЕРІ 


On the whole this correction is not large. For n = 20, k = 5 it is 


— ya 3 0-020 + 3: (£3 — 32) (720-063 + у. 9-010) 
while for n = 20 and k = 10 it is 
=y: 0-015 + J: (£3 — 32) (y20-086 4-у10:013). 


For the sake of illustration consider the rectangle, where y, = — 1-2, y, = 48/7. Тһе altera- 
tions to the tabulated percentage points are as follows 


! 


Degrees of Normal Cor- Normal Cor- | 
freedom 20.05 rection | 20.01 rection | 
—PnA OE ЖИИ c—À — | 
p | 
h=4f= | 0-016 0-794 0-066 
fi = 9% fe = 10 | 


0-011 | 0.799 | 0-072 
| 


The corrections are not, on the whole, large and are unlikely to cause serious error in the 
application of tests of significance. The stability of the 5%, significance level is worthy 9 
note, since it has become apparent in many non-normal investigations. (See, for example; 
Pearson and Merrington's table 42 in Biometrika Tables for Statisticians.) 

Tt will be noticed that while the effect of non-normality with equal groups is to alter the 
mean and variance of normal z by terms of order n= the effect of unequal group frequencies 
s that it is no longer possible to expand 
about moments of 2у since the Edgeworth expansion requires the assumption that the 
difference between the second cumulants of 24 and zy is of order n-2, The simplest thing 
would seem to be therefore to expand the mean and variance of 2, with unequal group 


frequencies (Z4 say) about its own mean and variance, remembering that the cumulants 9 
zy which can be substituted are given by 


2k = log fa — log fı + ($) Vf.) 

kev = ЦЗ)  (— 1) VG (s> 1), 
where У is the digamma function and 8-1 is its (85-1) differential. For moderate dive" 
gencies from the ideal equal group frequency situation, such as envisaged in the expansion? 
for the moments in terms of v given earlier, it is clear from numerical calculation that #16 
effect of these divergencies is to bring the mean and variance оҒ2% back towards those of Zn" 
The greatest contributions to the mean and to the variance sore, jn fact, 


РА 
Kias = Kiy — ys (+=) 5 


EB) B, 


Код» = Ken + Ya ( 
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1% ің Қ i 

$ 8 сл therefore that calculations for the case when the group frequencies are equal 

1.е. Whe — 0, will give a reasona e idea of the mag iitude of the error involve еге 15 
пеп v = 0, will give a rea nable idea of the magnitu ftl rror i lved if th is 


non-normality. 
s been one of method, but E. S. Pearson has pointed 


Тһе purpose of this present note ha 
out to me that Gayen (1950) also considered the F-test in the one-way classification in some 
h the effect on the nominal 


detail. Gayen was concerned in his numerical examples wit! 
5% F-level of non-normality and une qual group f requencies and his results are not therefore 
able with those obtained here. It is, however, possible to use the same tech- 
as has been used above to obtain corrections to 
then for the equal groups case 


directly compar 
nique to obtain corrections to the tail areas 
the percentage point. Thus if g(7) is the normal ordinate at 7, 


the correction to Я 1 
Led 
js к p; dt 
will be found to be, to order n^, 
у (Kaa — Ken) (93 — 39) (Kaa — Fax 
ТІ ЕГЕМЕН “a Г ^ 


For reasonable sized іс we may write this as 
nye 09-30 ® (n (© =) ni (2-7 a) 
000 |а s: B + SCA OA AA A АЛ 
ot appear to depend on the 


he correction does n 
equal to the standardized 


the major part of t | 
for 7 18 


Somewhat surprisingly; 
h these are implicit in 75 


degrees of freedom, althoug 
percentage point of zy; i.e. zy (a) ках 


assumes for the parent population 


The compari i Г ek i 

А parison with Gayen's Wor k is : i а»? 

itself a; ; . im which the terms involving Уз (of the population) and yz ате 
n Edgeworth series in " me arison should only be made inthe 


assu e" ы ч ^ 
med not toexist. It would appear ther “tice definite, i.e. із capable of being 


region in which the parent populat Type VH for my population 

a probability distribution. I have therefo degrees of freedom of 
Е distribution with 104] Let the degrees ot tree om o 

ive the correction to be made to the 0:05 


difficult in that he 


а Pearson 


жы Yo = l and у; = 10. [This jsa t- ere 
he samples be f, = 8, J2 = 12. Gayen’s ta ew | t иче 
“pper level an de a Substitution in formula (3) gives the oig семей 550. ль е 
Corrections are therefore of the same order, and (3) сал be use s 
ual significance levels for reasonable ^. 
а о 


error involved for any of the us 
draft, and Н. Scheffé 


in the original 
emedy the situation. 
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APPROXIMATION TO THE DISTRIBUTION OF SAMPLE SIZE 
FOR SEQUENTIAL TESTS 


I. TESTS OF SIMPLE HYPOTHESES 


By D. H. BHATE 
University College London and. Delhi School of Economics, New Delhi 


l. INTRODUCTION 


E > sis 
A. Wald (1943) developed a sequential probability ratio test for testing a simple hy nr 
against a simple alternative, He gave approximate formulae for the sequential tes tolio 
cedure, its average sample size, and the power of the test. He has also derived the eee же” 
form of the distribution of the sample size. The actual distribution of the decisive sa 


Pup f the 
number (D.s.N.) has proved difficult to determine. Кас (1945) gave the distribution О 
D.S.N. for the test approximatel 


the mean of a Normal distr 
Darling & Siegert (1953) 
approximating the se 
known variance. Bu 


av of 
t in practice it is rather difficult to evaluate the actual probability 
reaching a decision 


зр саат OF 
at a specified value of л. Baker (1950) has studied the distribution 


satisfactory results 


В . ceu- 
of n, because the errors of approximation & 
mulate with n. 


"m 
In this paper we shall derive general formulae for upper and lower bounds for the cumul 
tive distri 


ibution of the D.S.N., and use them to estimate its approximate distribution. 


2. DEVELOPMENT оғ THEORY 

2-1. Let f(x, 0) be the probability densit: 

& parameter 0. A sequentia] procedure, 

alternative H,:0 = 0, (Нұ and H, may be 
follows. 


ы " ion 
Suppose that Ti 5, ...,%; isa sequence of independent Observations and that t;, а functio 
of these 278, is an appropriate test statistic, Th, 


en the sequential test procedure is based ең 
а partition of the range of variation of t; into three mutually exclusive regions C;, A; and #p 
using the following rule: 
(i) ift;eC; take a further obs 
(i) ifte Aj, accept Ны 
(iii) if tj € Ej, reject Н, (i e, accept H,). 
The regions are chosen in Such а way that the prob: 


they are true are approximately æ and B, respectively 
Following this rule a decision is rea 


d 
ched at the ith Stage ift;ec, forj -1,2,..,4-1, pes 
either f; € A; or t;¢ R;. The argument used below to determine the upper and lower bou? 


A n 
y function of a random variable a5 dependi i 
for testing the hypothesis Н,:0 = 0, against d as 
either simple or composite) may be constructe 


ervation; 


n 
abilities of rejecting H, and Н, whe 
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for the distributi 1 isi 

: he distribution of i, the decisive sample number (p.s.N.) may be illustrated with the hel 

е accompanying diagram, in which j is taken as abscissa and t; as the ordinate The 
aries between the regions have been represented by the lines О'А, and O'R ый 


necessarily straight lines). 


о’ Accept Но А 


Axis of t, 


Continue sampling 


Axis of j 


Fig. 1 

ched at stage j = i, sampling is continued 
le sequences of samples are repre- 
put in one case tm € Apm and in the 
k enters the rejection region at 


" 2-2. Suppose now that although а decision is reachec t 
о a fixed stage j = m (23) In the diagram, two possib 
sented. Each track enters 6 ance region abj =$ 


other case ё, «С, Similar sequences of samples when the trac 


— i are also possible. 


Now а А abilities. 
efine the following proba г before the mth state and 


a decision is reached at, O 


Зан probability that 
mE On, = P {imë Cn): 
(b) p, — bus that а decision is reached at the ith stage (and not before) 
= Pf{t,e0, for j= jar and teAg Pte for je 2-51 
and eR} = pia * Pin (вау). 
(c) pj = probability that a decision js reached at the ith Pr (and not before), but that 
when sampling is continued to the mth stage Ji M ) 
E x шы 1, t,€4 and шт 
-Рфеб, for 7% pc anf ГЕТ ер, and tn€Cm} 
ее 
= pistPin (885). of these probabilities. The continuous path 
ut not with рд; the broken path 


T А meani 
he diagram may help to clarify the “a r and £m 
ed with Pi put not with P, 


w 
Pines beincluded among those associat 
uld be included among those associate 


hen it follows that m + 
ze 


(2-9-1) 


9-2 
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istributi isi le 
hus P, provides a lower limit for the cumulative distribution of the decisive ces 
> = ase the upper bound for p; апа pip we shall obtain an upper limi 
number. 
the cumulative distribution of the p.s.n. 


Let Pills) = lt; |t;€C; for j =1,2,...,i-1) 


i isi ached 
be the conditional probability density function of t;, given that no decision has been reach 
before the ith stage. 


Then 


Further Pia = a т], 8 ры) din) dt;, | 


Pia = | P:lt:)dt; and Pir = | РЕКЕ) dt;. 
x he A tie Ri 


where pi(,,/t;) is the conditional distribution of t,, given &. If we write 


t 
12 


9.2. 
Jim = max | ЕСІГІН ( 
НЕА УтЕСт 


it follows that Pia < nim | Pilti) dt; = 91, p; 4. 
lie Ac 
Similarly if 


2.2.3} 
бы = max | P(t | t;) а. (27, 
кеш JtmeCm 


Pin < SimPin- | 
Hence from (2.2. 1) we have the bounds 


m т-1 т-і (2-2-4) 
Р, < 2 Pi < P. > Nim Pia + X Sim Din á 
= = iz 


The result is valid for an 


y hypothesis H (not only for H, and Н). 
Two less sharp, but som 


etimes more easily calculable, inequalities can be derived from 
(2-2-4). These are 

m т-і1 А 9.9. 5) 
У т< Р,+ У Piim + и MAX Ôim when H, is true, (2 
ізі i=l im 
A nt а 2.2.6) 
2 Di < P, Bmax yy, + P Pi8im when Д, is true (224 
a. т 1- 


We can use (2-2-4) 


of D.S.N. Н7 и and бу, can be evaluated, and, bein 


2-3. Consider now a special case of comparison 


of two simple hypotheses H,; Н, and вир” 
pose that we know that, with an obvious notatio 


n 


pil) = РЕН,). 


А 1 
This is so, for example, when Н, and H, specify two values for the mean of а Norm? 
population, anda = f. 
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For all decisior oa 12е ? 
ecis; 5 і 
ns to a ccept Ho, at sample S] i 


p -әш| H) < erm | Н), 


80 B 
path) < тариб) 


152 
pal) > 122 y (Hy. 


and similarly 
Approxim í 
ximately (i.e. neglecting the amount by hi imi 
ma. з: безе 5 5 thea y which the limits are overshot the inequaliti 
y be replaced by equalities, giving, when 2 = В, ) equalities 
pia) ы pao) = a 
Si pao) pir L=@ 
nce 
pia Ha) + pas H2) = ph) = pH) = pia Ho) + Pirl 
p; 4D) = Pi and pialHo) = (1 бин (233) 


pin) = (1—2): and pao) = Pe 
qualities (2-2-4). 


Hy). 


If (2-3. 
(2-3-1) be assumed to be true we can proceed to use the ine 


Table 1 
pu 
i с 4 с | 4 с 
Е Ligen as н = 7 
1 0-0860 6 өш | 11 00686 
2 0778 1 дё | 5 0686 
3 .0710 8 .0686 | 13 -0686 
4 -0693 9 ias | 5 -0686 
2 ты oo T 
| 
а есетін -— Eni Pu s 
I ДЕГЕ” я cures obtained by quadratu 
n order to see how far this assumption 18 justified we quote fig n yd ure 
istribution of the D.8-- for the case 


in th 
he course of the calculation of the exact 
1 det 


p(z19) = Jen" 
== 00158: 


ә below, to give simple limits 


Th 
| е common value 0-0758 for ® апа 
tion are given in table 1. 


Тһе " j=1 
A alues of the ratio ¢ 7 pas Ho (Ho) OO" 2 statistic over the 
Part from some variation j, when so" E Thus the assumption of constancy 
JOundariegs i a H) РАНО) = 0:0 6. Thus tae" toon 
the r 18 to beexpected. ^ Pinon E he probability of termination for 
-12 1: n = þe Tn di tio 18 rather less than the 
»8,4, is fairly sma! 


Omi 
inal value of a). 


isi m 
m = > т). 
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3. ILLUSTRATIVE EXAMPLES 
1 
A (277) 


Н,:0-0,-0, H:0-0,—1; «= В = 0:055. 


Example 1. Let 


f(5,0) = = е-ін-, 
(i) The continuation region at the mth stage, Cm, is defined by the inequalities 


0025 m Ја, 1) 1—0025 
1=0-025 < 1/0, 0) < 0025 


1т " 
or —1log,39 < E- = [(®;— 1)? —22] < log, 39 
ie. —3:66-- 3m < 1, < 3-66 + im, 
m 
where h.c х жі. 
i 
Similarly (i) —3-66--1m >t, for region А 


т» 


> 
(Ші) 3-664+4m<t, for region В, 


т т 


Denoting the probability density function of tm by p(L;), where in this case 


I 
Plim) = Ттт) exp(— в, — тд)? т}, 
\ 


7:32--1(т-1) 
we have Tim = | 


m Ü Plim) dt nis 
т-і 


+ ilar: 
since this value corresponds to the case when t; is exactly on the boundary of C. Similarly 


‘4(m—i) 
Sim = І Plimni) Фе. 
-Т32-(т-4) 

The following Table 2 
tive distribution of the D 
frequencies in the last col 

Comparing the theore 
x? = 19-0022 with v = 


gives, when Н, is true, the u 
S.N., using (2-2-4) 
umn are taken from ane 


Pproximate to the cumulative 50 
point between the upper and lower bounds is not disproved y 
these results. 
Example 2 
Let a = В = 0-0758. This case has already been referred to in $2. The regions С, Am: Еһ 
are now defined by 


бы —2:50+4m < „< 2-50-14, 
А: —2:50+4m > ls 


Peltm), the conditional probability density function of t 


; А еп 
m БіУеп that no decision had be 


D. 


stage, was computed by quadrature. The distribution 


reached up to and including the mth 
oft 


ml 


рт) = | 


tmeCm 


The results of these calculations 
frequencies taken from th 
kindly put them at my disposal. 


Table 2. Estimation of p; for Exam 


Pelim) (та) dim 


are shown in Table 3, 
e unpublished results of experimen 


ple 1 and comparison wil. 
with N = 2002 
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= та WAS then computed from the formula 


where 2141 = maim 


together with some observed 
ts carried out by Baker, who 


h Baker's empirical results 


| 
5 | Expected . 
Sample Lower Upper NUES Diff. = frequency Observ ed. 
no. $ bound L; bound U; et NO frequency 
1 0-00080 0-00080 000080 0-00080 2 0 
2 :03055 -03080 03068 -02988 60 60 
: 7 11194 -08126 163 140 
3 -10709 -11680 
| 22231 11043 221 217 
4 -20560 .93914 22 e E 
5 -30451 136645 33548 11311 2 2 
5 2 
6 039684 0-48287 0-43985 0-10437 ie m 
7 41944 .58289 -53116 09131 8 183 
8 -55112 -60538 -60825 .07709 is 188 
9 313592 .13289 67320 :06495 
10 “60583 78627 £72605 -05285 106 12 
-6658 -1862 2 
11 0-71173 0-82976 0-77074 (pus S ә 
13 = аа pes 102107 62 63 
13 78164 89726 pen “09455 49 г. 
14 .81545 91855 е 109178 44 43 
15 -84090 .93667 888 
0:01740 35 4l 
1B 0-860174 0-905002 990018 Ше i E 
1 88059 96231 ed 01367 27 37 
T 89761 27865 hem .01045 21 26 
19 -91114 -98000 n “00906 18 a 
20 92334 .98592 95 р 
"T 0.02807 56 54 
21-25 0:93419 — Ея 101730 35 27 
> 25 -96838 — 
| d the exact expected frequencies 
ncies Nf; ал à 
Comparing the approximate expected freq = 8-7304 and д? = 13-4573, respectively, 
: D; with the observed frequencies W° nd that X ‚ between the expected and the observed 
D v — 16, Thus we find that there is nen appro ximate expected frequencies and 
Tequenoies, It is clear from this table ih п. This supports the assumption that 
ү кай жаш б = fequens P ibution the mean of the lower and upper 
cumu ative ОВ 


wi 
E may approximate the 
Ounds, 
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Table 3. Values p; obtained бу (2-2-4) and by quadrature and 
their comparison with empirical results with N = 698 


ы Expected frequencies 
А Арргохітаќе Exact сай 
Sample probabilities probabilties Observe ; 
по. Bi pi д frequency 
р Ур; Npi 

1 0-02410 0-02410 17 17 17 

2 13126 :13355 92 93 100 

3 -16425 -16311 115 114 111 

4 -14608 -14459 102 101 100 

5 11702 11785 82 82 94 

6 0-09112 0-09257 64 65 66 

7 :07080 -07229 49 50 45 

8 :05613 :05698 39 39 37 

9 :04326 -04376 30 31 41 

10 -03489 -03399 24 24 18 

11 0:02796 0-02643 19 18 13 

12 :02258 02052 16 14 12 

13 :01844 :01594 13 11 13 

14 :01526 -01238 11 9 8 

15 :01236 -00963 9 7 7 

16 0-01026 0-00747 1 5 3 

mW -01423 -02604 10 18 13 

Example 3 


Let x be a Normal variate with zero mean and variance o2. 
Ну6-0,-1, Ho =0,=3; а= В = 0:01. 
The test procedure is then based on the following regions: 
m 
Ст: — 10-350 + 2-475m < У 25 < 10-350 + 2-475т, 
үзі 


т 
Am: — 10-350 + 2-475т > У, 22 
і-1 


m 
Е,:10:350--2-415т < У, a3. 
j-1 


In this case we cannot assume that p; ,[p; is approximately constant, but will use one of 
the bounds given by (2-2-5). We shall caleulate the approximate distribution of the D.S-N- 
under the null hypothesis, and will use the formula 


m m-—1 
В, < E pi < Pat X 2: +A max бу. 
is 


i,m 
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We find tha : i 
nat max б, is very nearly equal to 1 for number of combinations of i and m 


im 


and the И Ес ; 
refore we can simplify our calculations by calculating the bounds from the formula 


P m т-1 
т < d Е < P У Piim t 0 
i=l i=l 
where 20-700--2-475(т —D ^ " 
Шал = ч ТКО) dX»-i 
2475(m—i) 


m 
(since Уу 2315 distributed as xar) Я 


ј=і+1 
. distribution, together with the upper 


TI ; Ё 
ne following table gives the approximate D.S.N 
hich the approximate 


and the lower 
dieu low er bounds for the cumulative р.8.Х- distribution from W 
ibution was obtained by averaging. 


Table 4. Esti mation of p; for Example 3 


| 
Sample | Lower | Upper | Меап оЁ the Approximate 
n2 bound bound | bounds probabilities 
" Li Ui | (Lit ЕН) Pi 
= - БЕ cec | E 
| 
5 0.13722 0.13722 0.13722 0.13722 
6 -39140 41729 40434 .30712 
7 56689 01939 .59314 -18880 
8 -69340 15023 -72481 113167 
9 „17997 .34338 181167 -08686 
| 10 0-84448 0-90972 0.37360 0.06193 
| 11 188878 194098 101458 04098 
12 191951 196396 104178 09715 
|” 13 -94342 98146 | .96242 02069 
14 195838 09105 | 101472 01230 
> 15 0-96991 — | 1:00000 0-02528 
бала therefore decision to accept Но 


Th г 
e region A,, does not exis 
hing & decisi! 


is ; 
not possible for these values ofm; hen И 
pability of reac 
js almost negligible. 


pr и 
obability p; contains, in addition to the pro 
atm = 1, 2,3 ог 4 which is а 


probabili : 
ability of reaching а decision t 
(A-S-N-) or A SEQUENTIAL TEST 


Ir 4. BOUNDS FOR THE AVERAGE SAMPLE NUMBER 
we write, abbreviating (2-2-4), 
Tm , 
m < Pi < Pm: 
th ігі 
еп it is easy to show that, for sufficiently large М, = 
a ES 41-28 (41) 
N41- PR < Xn NI- йе 
ities tends to the a.s.N. of the test. 


As Nt i in these inequalit 
ends to infinity th central n thes А re 
n Table 5 the limits f E o д.3. 50 obtained are compared with the limits given by Wald 


fo 
[ © the problems stated in Examples 1 and 2- 
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5 i the 
erimental value 8-8 of the A.s.N. was taken from Baker (1950) and is ы S rdi 
ыг ret 2002 tests. The other experimental value 5-65 is based on the unpublished r 
results of 2002 . 
698 tests carried out by Baker. | : | "P 
А 1 this table we can see that the lower bound, given by (4-1), is sharper than that [em 
by Wald, but that Wald gives the sharper upper bound. Hence, the lower bound in 
кл Wald's upper bound, give narrower limits than either system alone. 


; 0 7а!0% 
Table 5. Comparison of upper and the lower bounds for the А.8.Х. obtained by ч Wa 
formula with those obtained using equation (4-1-1). Case H: 0 = 0; H: 0 = 


Risks ace | а = р = 0-025 | а = f = 0:0758 
| ! 
| ы 
0 і Б Equation ar Exact 
Result obtained by ...| Wald yc. Expt. | Wald b» ges Expt. 
eum MED жәнее 
Lower bound 6:9 7-61 -- 4:20 4:94 — He 
Approx. A.s.N. T2 8-62 8:8 4-94 581 | 565 58 
Upper bound 8-9 | 9-64 6-10 7:20 | — 
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TOURNAMENTS AND PAIRED COMPARISONS* 
By H. A. DAVID 
Virginia Polytechnic Institute and University of Melbourne 


т 1. INTRODUCTION AND SUMMARY 
n an experiment involving paired c stimuli’ such as flavours or colours are 
е іп pans to a panel of x judges. The basic experimental unit is the comparison of two 
i NE a single judge who must state which one he prefers. Such an individual paired 
yn may be regarded as а game between two contestants—the stimuli—while the 
11 the possible it(t— 1) 


experi 5 

o periment as a whole is analogous to & tournament of t players. Ifa 
о j : 
mparisons are made by eac в corresponds to à Round Robin tourna- 


m h of the n judges, thi 
nentrepeated times. Ttw aper to use the language of tournaments 


rat ill be convenient in this p 
лег than that of paired comparisons. 


omparisons # ‘ 


Round Robin tournament, 


The analogy between 2 paired comparison experiment anda 
er well-known way of 
running competitions, the Knock-out tournament. Tt is perhaps surpris 
ments do not appear to have been studied at allin the statistical literature, although this may 
9f all players is desired, it loses much of its force when the aim is to pick the best player. In 
$2 some properties of а simple (1.6. Knock-out tournament are investigated; 
such a tournament, various assumptions about the strengths of the players being made. 
Round Robin tournaments (in effect) have received consi ion both from 
lem is the enumeration of the possible out- 
comes expressed t n А 
59 as scores, of à tournament 0 a gi 
frequencies of these outcomes. Тһе matter has been investigated py Bradley & Terry (1952) 
. Bradley & Terry have also 
Proposed a Ў а 1,8,0 2779 
ed sed supero different model had been suggested 
ity of scores Ma, 
n involving the sum of 
a simple ош Н 
is T ndall & Babington Smith (1940). 
z ith problems of this 
‚о specifie players- wi : 
gens 8 mple Round 


first pointed out by Kendall (1955), suggests the investigation of anoth 
ing that such tourna- 
be due to their inherent lack of balance]. However, while this objection is valid if a ranking 
in particular, an expression is found hich the best player wins 

derable attent 
о peyshome ни = geo i d the determination of the 
and Bradley (1954) and is further considered in $3 L Brat ofthe players ат 
Ln pede y be based on these 
nd Robin tournament the latter 
lad test t questions as what constitutes 
(<t) players іп а 8й 


a tournament is four § БАП 
rength. This result is used to obtain tests of € layer, (ii) the difference in the scores 
E^ per percentage points of 0) the score of 2 8 ecified | т” 
А ош Я 
bwo specified players and (а top score in Ta in ga 
* ome aspects of repeated tournaments are Шо Research. 1.5. Army 
t mes supported, in part, by the Office of Opis (1958). 45, 581-6) published since this 
ее, however, а paper №: Rita J. Maurice 
a , а paper a 
Por was written. ED]. $ ` 
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2. Кхоск-огт TOURNAMENTS 


that the number ¢ of players has been reduced, by preliminary gura 

ks m» а power of 2, say t = 2р, The tournament will then be completed in p темен 

шар are of igual strength, each has probabi lity t-1 = 2-» of winning єз кеде 

ment; ог, more briefly, each has winning probability 2-р, Likewise, if a E ee nit 

beats any one opponent with probability л, he will win the tournament with en a siens 
In more complicated cases the winning probability of a specified player will e 


қ ; s call the 
depend on the initial tournament draw, which we take to be Strictly random. Let us 
t contestants C, C, ..., C, and write 


GO. 13 и BEG), 
to express that C; defeats C; 
bility that C; reaches the rth TO 
P(P is his probability of winni 
meets C; in the rth round an 
tournament. Clearly 


85 
with probability z. Let Р) (r = 1,2,..., p) denote the Lee 
und of the tournament and wins his game in this round, " £C, 
ng the tournament. Finally, let Mf be the probability prot 
d М; be the probability of their meeting in the course 


p (2 1) 
М; = Ў Mfg. 
r=1 


Suppose now that с, 


C, Sat Ons 
Then бүз probability of winning the tournament is given by (2-2) 
PP = Maman? (1— M, gp. 
If the draw is known, М. 


12 and hence P(? сап be ey: 
Shall obtain expressions for М, 


prior probabilities will still be 


aluated. Instead of considering а” n 
2 and P®), the values of M,, and Р) before the draw. Th 
linked by (2-2). We have 

MY = 1/(@— 1). 


are to meet in the Second round, i 


cessful in the first round, so that 


ust 
If C, and С, n which there will be 2t contestants, they m 
both be sue 
М9-(1-ИФтл,(И- 1) 
= ?n,n,/(1— 1). 
Continuing the argument, we find 
И = (2n nya, 
рен Dn 
and hence from (2-1) Мы-у (27, 9) 1 |(t — 1). 
r=1 
It is interesting to note that the М 9 


depend on 
We see also that the probability of a fi 


nal between С. and C, 


(mm) P= = 2M. ic 
ісе 
where C, defeats О, with Probability л,,, the only x ‘the 
Ti; (due to the absence ОЁ ties). It should be noted tha 
Лы > } 


We turn now to the general case 
tion on the 7; being Tg = 1— 
possibility п> 4, Wd 
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is not excluded (compare Kendall & Babington Smith, 1940). This situation can occur when 
he best player as the one 


the scale of playing strength is not linear, in which case we define 6 


with the largest expected score. 'The models of Thurstone and Bradley & Terry assume 
expressions for the лу. Аз we are not 


a linear scale corresponding to the choice of suitable 
concerned with estimation, they need no separate treatment here. 
= 8 and suppose (without loss of generality) that 


ie 2 
To illustrate the general case take / 
Then we have, for example, 


Су, ..., Cy have been drawn in that order. 


ISP тұ, PP Tay PS) = Tan (2-3) 
Pp = Py (mS PP) + та РР) = тутата + Ту Таз)» (2:4) 
(2-5) 


PP = Роль PP mo PE +т PP ems PP). 
Тһе corresponding probabilities before the draw сап be obtained by averaging over all 
possible draws. Let Pf? (iJ, k) denote the joint probability that C, beats C; in the first round 
and the winner of Суг. Cp in the second round; then P®(2, 3, 4) is the expression of (9-4). 
Likewise, we шау (for t = 8) write P of (2:5) more fully as Pf9 (2, 3, 4). Tt follows that 
(2-6) 


Ті» 
EI 
EJ 


M 

- 

\ 
м ® 
t4 

È 


Ж 
үз. 
& 

+ 
n 


y, PPJ, k), (2:7) 


ga 
© 
© 
- 
33 
um 


Ж 
\ 
а. 
14 
4 
x 
© 
= 


i j <k 
РЕК Ее 
d side may become formidable. Thus Pf? is 


For t > 8 the number of terms оп the right-han 
2ў 


th р 

пе mean of 1—2 1-4 E 1-7 terms. 
4-2 À 4 PE 

Example. Іп а tournament of 8 players, let Сіз ‘strength’ in his encounter with C; be 


represented by "ES 
ror variate Vij is №0, 0°), the 56v; 


23 bein 
rength and the er и в 


Where £, is С/в expected st r 
Mosteller, 1951). 


mutually independent (compare 1 


Then [a 
Е, mm Jen 
пз = Р (Ё vj; € SiT 0 m 2 


et" du. 


viateinsamples of 8, 
the strongest. The ё; 
ray of 77,;'s results: 


ы & to be thee: ect ed v e oft he ith lar est unit nor mal de 
хр alu £ С 


во ; ing order 
that the players are arranged in селе For g-1the following ат 


have been tabulated by төсімен (1990). И — 
i | | с с 
с, с; т 8 
€, | € о, Cs Ше M E EE 
TUNE NÉ ыл тн “е. я | | 
с РІС 
А m өті | 05295 | 07579 08475 | 090 0.9074 | 0"9710 
a 072839 pe 064178 | 0.6256 07341 | (ыы 0:8425 | 09425 
с, 01709 | O9 | рама 52 н не 0.7579 | 08982 
с, pres Her pem | $355 "m = 0.6478 | 08291 
с; poen pem pie | 02653 | aum 0:3522 ый 0-7161 
2 | | papi | ке Е ore p^ 
e 0-0114 0-0442 0.0020 | 0.0575 | 01018 | 0-1709 | 02839 
8 0-0022 | 00114 0-0290 | 
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It will be seen that due to the symmetry of the normal distribution 
Пу = 7lg. 5,9. 
i . 2-7) we obtain 
Applying (2-6) and (2 | | 
РФ = 0-906, 0-769, 0-656, 0-551, 0-449, 0-344, 0-931, 0-094: 
i , , 
PO = 0-781, 0494, 0-319, 0-200, 0-117, 0-060, 0-024, 0-004. 
1 > E 


Itisinteresting to compare these probabilities before the draw with those in a draw in vw 
considerable seeding has been used, the first round games being C, v. Cy, C, v. C,, Cv. C 
C; v.C,. From (2-3) to (2-5) we find 

РР = 0-998, 0-956, 0-828, 0:620, 0-380, 0-172, 0-044, 0-002; 

РР = 0913, 0-662, 0-312, 0-064, 0-023, 0-020, 0-006, 0-002; 

Pf = 0-692, 0-216, 0-066, 0-019, 0-005, 0-001, 0-000, 0-000. 


3. Rounp Вовіх TOURNAMENTS 
The results of a Round 


- -wa; 
Robin tournament can be presented fully in the familiar two-way 
table of 1’s and 0’s 


: : has 

mary of the scores, we may write (ajas... a), meaning thereby that C; Ay 

е , ; У , 

Wona, games, C, has won аз games, and so on; clearly a; = 4-1). Ifthe a’s differ yer А 
а real difference between at least some of the players is indicated, Thus a Round Ro 


, impl 
and we may replace (a, ay... а) by the partition [zi 23 ... v], where the 2’s are simply 


r- 
а rearrangement of the a’s in decreasing order of magnitude, and r, is the number of occu 
Tences of the largest score ті, etc.* Tt follows that 


m m 
Ў т =, У та, = 2—1). 
u=1 u=1 

One test of Н, is the ‘circular triad test’ due to Ken 


another has been given by Bradley & Terry (1952) and is based on their model referred 60 
earlier. In both cases the test-criterion is a Symmetric function of the scores, and its nU 
distribution is easily deduced from a, knowledge of the different Permissible partitions 9 
$t(t—1) together with their frequencies of occurrence, Such basie tables have been COP” 
structed by Bradley & Terry (1952), and Bradley (1954) for ¢ $ 5. These authors also phaw 
how results for several complete repetitions may be built up from those for n = 1 and a 
tables cover many such cases. In Table 1 we have extended their results for n = 1 tot € * 
using a score of 1 for a, win and 0 fora loss, rather than 1 fora, Win and 2 for a, loss. A 

A possible approach to the enumeration problem involved is to consider the generating 


dall & Babington Smith (1940); 


function 


a product of (1 — 1) terms 
contains 23479 terms, the exp 
For example, 

G(3) EX (b, * ba) (b, E bs) (by + 6.) — Zb, 2, 


8:2) 
2+ 2b b, b, ( 
* We depart Slightly from standard notation 


in showing zeros explicitly. 
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Table 1. Partitions of scores and their frequencies in a Round Robin 
tournament of t players 


[210] t= 3 t = 6 (cont.) t = 7 (cont.) t = 8 (cont.) 
ts 6 |(9395) 2,640 |[54°21°] 23,520 |[763:21] 134,400 
2 5423220 60,480 | 763323 147,840 
; = | 391 2,400 1533210 13,440 
| Total s | 350 144 | 542321? 60,480 | 753313 4480 
——— | 5423221 206,640 | 752220 4,480 
| Total 32,768 | 54794 28,560 
Qmd 5430 16,800 | 752221: 13,440 
[3210] 94 | 543321 223,440 | 75242210 40,320 
313 f ved 7534213 13,440 
2:0 g [6543210] 5,040 | 543222 183,120 | 75243710 80,640 
2p o4 | 65431" 1,680 | 5351 17,808 | 75243280 161,280 
™ | 654230 1,080 | 53:2? 72,240 
Total 64 | 6542218 5,040 | 4510 1,008 | 75743218 322,560 
653310 1,680 | 41320 16,800 | 7524221 188,160 
= 7523320 94,080 
di 6532920 5,040 | 4831? 16,800 | 7519212 94,080 
[438300] — әд | 6594215 10,080 | 45921 28,560 | 75492221 483,840 
4313 120 | 65321 11,760 | 43330 18,480 
4230 2 6525 1,008 | 493221 183,120 т A pem 
2 39 3323 99,680 , 
ur jp | жем and bs 4 1542970 94,080 
40 | 64313 560 45941 12,240 | 754921 188,160 
32920 6423210 5,040 | 423322 233,520 15423320 483,840 
32912 120 | 6423220 10.080 | 4352 72,240 
321] 240 | 6423212 20,160 | 37 2640 | 75423212 483,840 
Ss 280 | 642291 11.760 _ | 154321 1,653,120 
а Бк Total 2,097152 | 7522 228,480 
T 643320 11,760 754310 134,400 
otal 1,024 | 64331? 11,760 MM SL үт a 
————— (6207081 60,480 = 8 
4 16,800 |, 40,320 754323 1,464,960 
та imd es 1,008 io 0) 13440 | 753° 142,464 
543 1 0] 720 1684230 13,440 | 7592 577,920 
54 : 240 | 63191 16,800 | 26549212 40,320 74°10 8,064 
210 240 | базоз ya aati | 268 13440 | 744320 134,400 
542212 70l 9s o 14б 1653910 > 
53°10 240 fos "560 | 765320 40,320 | 74°31? 134,400 
| 63292 53230 560 16592013 80,640 143221 228,480 
ке 720 | ^^ ЗИ 04,080 | 743310 147,840 
213 1,440 | зова 1,680 | 76535 Soga | 78321 1,464,060 
oe eet | pe 40 | 1657 "440 | 74832 797,440 
595 pe 5242910 5,040 | 764210 13, , 
43910 44 | 524213 1,68 -— өй don 
240 | 5243210 10,080 | 764315 4,480 | 74 З 
24392 90,160 40,320 | 742322 1,868,160 
4313 д1" 43220 , 76423210 ово | 74352 577,920 
то 720 | 524321? 40,320 Dd 161,280 | 75, bdo 
arn 1440 | 524931 23,520 Deren 94,080 | 6843210 13,440 
1% f 293 11,760 
5 eeo | шәр 11,160 | 43220 94,080 | 6431 4,480 
| 2322212 oo dap | 16а 94,080 | 694220 4,480 
ie цаю | din 483,840 | 642517 13,440 
nen 1,680 | 5232 16,00 ТӨРІ 136400 | Өзен 4,480 
2 , 29292) 
22] 8,640 | 542310 in 16370 8,064 | 69372 0 13,440 


4394 
2,400 543220 
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Table 1 (cont.) 4 
= ont. t = 8 (cont.) іш8 (вош) ) 
Же. 8 (с A [6243223] 1,464,960 [esant M ned 
633931 31,360 | 62351 142,464 654322: 230.40 
6325 2,088 | 623422 577,920 65352 1,876,294 
62523210 40,320 | 65°4210 94,080 к 
3 64520 142,464 | 
6252813 13,440 ата 2 46 
| 652412 31,360 64512 142,464 
6252230 13,440 | 65%3210 94,080 641220 577,920 
| 62522212 40,320 | 6523220 188,160 64!321 3,790,080 | 12,230,400 
62542210 80,640 | 6533212 376,320 | 64123 1,025,920 pete 
6254212 26,380 | 652921 219,520 | | 9,461,760 
62543210 161,280 64°31 3,924,480 52 790,800 
65242310 483,840 | 6423222 9,461,760 11,515,080 
62543220 322,560 | 65242220 483,840 | 642342 7,203,840 
62543212 645,120 | 652422]? 967,680 | 643° 725,760 | 571.920 
6254931 376,320 | 6524390 1,653,120 | 53910 8,004 5239 511,920 
6253220 188,160 | 65243212 1,653,120 1 uis 1,576,224 
6253212 188,160 | LE Elbe 7,203,840 
65743271 5,644,800 54310 134,400 А 11,518,080 
62532221 967,680 | 65242: 779,520 | 514220 134,400 | 54132 , 
625391 268,800 | 652310 228,480 | 51421? 268,800 | 3,157,120 
6242310 94,080 | 6523321 3,037,440 | 513220 228,480 | 5432 22 501280 
6242220 94,080 | 6523223 2,486,400 | 5423 9 1120 
6242912 188,160 | 513212 298,480 | 470 725.760 
| 654110 134,400 5 779,520 4531 571,920 
62423220 483,840 | 6549320 1,787,520 | біз ` 107,520 | 4522 s 
62423212 483,840 | 6542312 1,787,520 | 534310 147,840 5 201,280 
6242392] 1,653,120 | 6542221 3,037,440 | 5342320 1,464,960 | 45322 3230,08 
624291 228,480 | 6542330 1,464,960 | 4438 
624310 134,400 5942312 1,464,960 5 456 
65423221 — 14,515,200 | 554292] 2,486,400 | Total 268,435,4 
6243291 1,787,520 | 6542392 7,889,280 | 554330 197,440 


the sum being over 6 terms, This identity may be interpreted as on. 
3 players there are 6 outcomes of type [210] where one player wins both his games, à sec ins 
wins one game and the third wins none; and 2 outcomes of type [13] where each player ber 
one game. G(!) is symmetric in the a’s and may be expanded as in (3-2) as a sum of mono? 
symmetric functions. Table 1 has been constructed essentially by noting that 


tof 
follows. In a tournamen 


G(t4-1) = aw TT 


П (+6) = СЫ + (1) 0153 + (12) bet... + (15), 
(18) = Zb,b, .. b (8 = 1,2, vt). 


JB; Р” 

Three generalizations of (3-1) are perhaps worth pointing out, although they дот 

necessarily provide an easy solution to the problems involved. «on i8 
(а) ИС; and C; play ^; games instead of merely one, the appropriate generating functio 


II (b; + 6)". 
i<j 


where 


(b) ІЕС, EM C; for all i + j then G(t) can be replaced by 


П (бт + бул) 


і<) 


to generate the probabilities of the various partitions, 
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а л Suppose that instead of being two-point (1 and 0) the scoring scale is (r+ 1)-point 
„(= Yl, >>» (47), 9, with the total score in any one game still unity. Then 
П (b;+ probi ЭРЫ We bj) 
і<) 
s may end in either a win, 


enerates ғ si isi 
generates all possible outcomes. This includes the case where game 


а loss or a tie (г = 2). 
3.1. Significance tests 
th thods are available for testing the null hypothesis Н, 
м E all t players in a Round Robin tournament are of equal strength. These tests are exact 
= y for small values oft, but approximations for large t have also been given. However, they 
= overall tests and do not provide an answer to the more detailed questions we may wish to 
ius We consider the following questions: 
(i) What score must а s pecified player reach to be regarded аз significantly better than 
average 2 


As already mentioned, several me 


etween two specified players? 
a significantly high score? 
f scores, on Ho; of s(« t) 


(ii) What constitutes à significant difference in scores b 
d, what constitutes 


(iii) On the basis only of the scores obtaine! 

In order to deal with these problems the joint distribution 0 
players is required. Сув score clearly has the binomial distribution 
in 
Ца) = ( а; 3r 
T o find the joint distribution of d; and а We note that these scores arise if C, defeats С; and 

wins а, — 1 of his other t— 2 games, while С; wins 47 and similarly if О, defeats C;. Thus 
ЖЕЗДЕ) Т 

[e dj a; / 091 

The argument may be extended to give the joint distribution of $ ме 0,0, ---. Ok by 
а consideration of the 25567? outcomes of the 356 — 1) games bate een t 1% Ы; 
Scores in this sub-tournament are аа m their final scores WI i 


Probabili 
mi 1 б е a e^ 
, ‚ ay- бк 


Qj, es Ue with 


fag ay st io 7908 = 5-9 a @ a;— 
РР 
Provided we use the convention 
3:4 
Мі forr <9 and #2" (3:4) 
" 
It follows that ^ 
WE x j^ (85) 
раа Ге” ае aedes а= @ 
ompatible with the 


w ib-tournament с 
here the sum is, in effect, ove? all outcomes of the 8" 


nal scores 
W | m 
" € turn now to questions (i)- ii) above- 
H n . 
Test of the score of а specified player rested ce, on Ho 4138 9 binomial 
ы C, be the player in whose performance we d м ші n ‘ternative that C, is above 
ез s yh, ИЯ easy t0 d the average strength of the 
avera h parameters vd T ent from Те m 
Бе strength (one-sided pinomial t jo, at the 596 level of significa: e 
or ех : Biom. 46 


p players (two-sided pinomial 5681) 
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following are the smallest values of a, which are significantly higher than the expected value 


i(t—1): i а, 
«5  nosignificantly high score possible 
6-8 t—1 
9-1 1-2 


> 11 smallest integer > J[t4- 1-645 J(t — 1)]. 


йа : і ith 
The large-sample result follows from the normal approximation to the binomial, wi 
continuity correction. 


—— ; t 
Next, suppose that the players are not of equal strength, but that it is desired to tes 
whether C,’s average probability of success лу equals $. We have 
ó (a) = (#— 1) 7i; 
t 
vara, = Ymy(1—7,;) < 1(£— 1). 
j-2 
ill be 
Since the variance of a is a maximum under H), we may conclude that the above test will b 
conservative if H, is not true. In fact, as 
vara, < (£—1)7, (1—7) ib 
: ; " 
we can also test the hypothesis that 7, has a specified value по by referring a, to 
binomial distribution with parameters £— 1, т. 
(ii) Test of the equality of scores of two specified players 


We shall find the probability distribution p(d) of d = а,-а,, assuming H to hold. 
From (3-3) 


p(d) = ХЛаь+а,а.) = әнші, а қ ( a ex(t-2) Шы!!! 


"AMOR ive 
where in view of the convention (3-4), the summations тау be taken over all non-negatiV 
integers аҙ. Thus 


20 = экза, 1) (255) d--—(—-1, —(t-2).., (фе), 


is a symmetrical distribution wh 
function of d is 


and hence that d/,/(4t) 
tendency to be rapid. 
At the 5% 


ose variance is J4. It can be shown that the characteris! 
Ф(ш) = (cos 3u)?-4 сови, | 
tends to a unit normal distribution. Numerical work indicates thi 


level of significance, the smallest significant val 


ues of |d| are as follows: 
|4] 


<4  nosignificant values of |d| possible 
5-6 4 
7-10 5 


> 10 smallest integer > 1:96 J(3t) + 9.5, 
While this approach tests equality of the two scores under H, it will also provide a con^ 
servative test of the null hypothesis 


CN 
Ho: Tih = ЛЬ. 


against the alternative 7T, m 
for, in the general case, we find ` 


t 
vard = 47,,(1 =T) ХІ =m) + 7/1 —л›))], 
j- 


which is а maximum under Ah. 
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(ili) Test of the top score 
is The test developed in this section is essentially 
not based on the scores of specified players, but on 
play ers). We obtain, on Ho, the probability of large values of x, from which upper percentage 
pomis may be constructed. Ifin a particular tournament the appropriate upper percentage 
point is attained or exceeded by one or more players, His rejected and, as in the analogous 
problem of testing for ' outliers › these players may be regarded as ‘superior’. 
erus easy to see that for t < 8 the probability distribution of 2 could be built up from 
e 1. We shall employ a different approach which is useful also for larger values of f. 


zu x, is the largest of the scores ay 42 соза, the method of inclusion and exclusion gives 
See, for example, David, 1956) 


t 
Pr (x, > t-r) = Dy ae () Pr (a, 21—702 2 7, Us 
s=1 


alues of 7; the later terms in the sum will be 
illustrate the method on theevalua- 


different from those in (i) and (ii) in thatit 
the score ту of the winning player (or 


> t-r), (3:6) 


w | . 
pss TR any integer from 1 to f. For small v 
ti 0, since only a few players can have à large score. We 
on of the typical term 
Pr(a, > t— 3,0 > 13,43 21797 1—3). 
d by С, C» С, and б. From Table 1 


pn 


we see that the 


Consi 
nsider the sub-tournament playe 


Possible outcomes are | 

930 

55 ; paie 181) [20 

ith respective frequencies | р 
24, 3, % 24. 

in final scores all greater than (— 4. 


Itis, h | 
› however, clear that only [31° and [2217] can result dob ma 
“ty Possible outcomes € Y order of бі to С), and the frequencies 
arise from [313] and |9212), are аз follows: (31) [221°] 
0 
fo #—$, 6-5 1-3 i i 
кыр, R4 deb 8—9 A a, 
1-9, 1—3, 1-9, 1-3 1-4)? 
1—3 1—9, 1-3 ye-2079 iet | 
айда ери onverted into the required 


ch can be с 


In this way we obtain from 


272, W hi 
quation ( 


"This 5; 
Prob Біуев a total frequency of 282 — 1726+ 
bg nr on division by 222-9, the factor 016 


i 
Pr (x, = 1-1) = ga: 


(2,6 


Pr > 1-2) = ga а 


2 
6 
2 apa 179 
Pr(z, > 1-3) = 200-2) 7 995 3 j 
2 ) ; 
(3) Aa) (пе 09) gin 
3/ (gts — 338 + qst- 64) — 217° " 


+ 931—6 
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These three probabilities are as follows for t < 20: 


t Pr (z, > t—1) Pr (z; >4-9) Pr (x, > 4-3) 
3 0-7500 1-0000 1-0000 
4 0-5000 1-0000 1:0000 
5 | 0.3125 0-9766 1-0000 
6 0:1875 0:8418 1:0000 
7 0-1094 0-6447 0-9987 
8 0:0625 0-4526 0:9637 
9 0:0352 0-2989 0-8536 
10 0:0195 0-1891 0-6868 
11 0-0107 0-1161 0-5093 
12 0-0059 0-0696 0.3548 
13 0-0032 0-0410 0-2360 
14 | 0.0017 0-0238 0-1517 
15 | 0-0009 0-0137 0-0950 
I 
| 
16 | 0-0005 | 0-0078 0-0584 
17 0-0003 0-0044 0-0353 
18 | 0-0001 0-0025 0-0211 
19 | 0-0001 0:0014 0:0125 
20 0-0000 0-0008 0:0073 


TNT а 5 vel, 
oken sequence of wins is not significant, at the 5% pee 
until? = 9 in the case of a one-sided test and t = 10 for a two-sided test. For a one-sided 
a score of t— 2 is significantly high for > 13, anda score of (—3 for t > 17. 


4. Discussion 


urnaments it may w 
start with a fresh ra; 
h the largest total nu 
8 of each repetition 
9 not recommend eith. 
ers in order of merit. 


simplest way of doing this is to 
declare as winner the player wit 
Alternatively, the initial draw 
symmetry requirements, We 4 
tournament is to rank all play: 
tournament will be preferable 


require the same (minimum) number of games. Но 
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tournamen S 5 
ts, of S 5 i 
ome sequential procedure whereby the number of contestants is progre 
y ; E 


sively reduced. 


In this r at 
paper att i wW: 
І tention has been concentrated on tournaments ithout repetition of any 


kind. Whe мр 

often асани напа а number of treatments in a paired comparison experiment, we will 
amente елесін a ee possibly by different judges. If repeated Round Robin 
алдып nos and эрек. large, the incomplete block type designs put forward by 
tiderodiin $5, млн ae (198P) are helpful. However, to answer the type of question con- 
ta: deal ilh 5 а izations of tests there given as wellasnew tests are required. Itis hoped 

some of these issues in a future publication. 

some helpful discussions and Dr I. R. Savage 


Miss Betty Laby and Miss G. E. Jacobs 
o of the Virginia Polytechnic Institute 


I : 
ы ас e to thank Dr R. A. Bradley for 
ед з өч. My thanks are also due to 
нн ersity of Melbourne and to Mr A. Roman 
nce with the computations. 
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TDERS 
ESTIMATION OF THE NORMAL POPULATION PARAMBLE 
GIVEN A SINGLY CENSORED SAMPLE 


By J. G. SAW 


University College London 


1. INTRODUCTION 


Е sample 

The problem of estimating the parameters of a normal population given an ordered sam] 
21 < 7... < 2, censored above т, has been treated by several authors. —á efficient 

Gupta (1952) has investigated the maximum likelihood solutions and the r тіс кеміне 
linear unbiassed estimators for д and g, the location and scale parameters. The E нів pane 
equations of the maximum likelihood solution are not immediately soluble. ү make the 
Gupta provides a table which, if enlarged to give finer argument intervals, irr as to the 
solution comparatively rapid. No information, however, seems to be availa = : rtan 
extent to which the resulting estimatesof pand с are biased ; thisislikelyto be o = а those 
in small samples. The estimators proposed below, while based on the same — omnt 
involved in the maximum likelihood equations, namely z,, and the first and secon nd g? are 
of the observations about x,, combine them in such a way that the estimates of д at 
unbiassed, however small the sample. ily han 

The method of most efficient linear unbiased estimators, whilst more casi y de 
computationally, suffers from two defects. In order to obtain the system of ds These 
necessary to invert a matrix of order equal to the number (r) of observations = pling an 
weights differ for every set of values taken by r and n, thus requiring extensive ta weigh 
arduous preliminary computations. Further, and this is a more serious difficulty, pene р. 
are functions of Ф(у), & (y?) and (0:03), where y, < V»... < y, is an ordered sample 


Н < 
Standard normal Population. Аб present these are only tabled for sample sizes ? 
(Sarhan & Greenberg, 1956) 


dled 
ib is 


Gupta has overcome the lab 


are 
et, the 

again limited in the use of them by the fact that, аз У 

ed with sufficient accuracy for n > 20. 


2. ESTIMATORS SYMMETRICAL IN 2, PT 
If we take x E . т-1 
И = 8, 1--(1 —6€)z, with Ша gl (r — 1), 
ізі 
" 7—1 т—1 2 
1 = а Y, (аур Д @— 2) 
i-i i=1 
as estimators of шапа g?, respectively, 
being chosen to make q* 


"m 


ese ёб A 


——Ó— 
a 
Р —— Á— 
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Е з : 
Vae ж has a smaller variance than the corresponding modified estimator due to Gupta. 
ie mptotic efficiency of 7* is unity for all values of "(т--1). 
part from the fact that they occur in the maximum likelihood equ 


r-l 2 
[ У (в: =) is that the values 25, ---»%—1 
1 


ations, the reason for 
considering sucl isti 7.3 2 

g such statistics as 2-1» Y (zx and 

1 


ent SV" ог 7 
ter symmetrically. Thus we may regard ty, +++» Tri as an ordered sample from the popula- 


tion iti i i i 
conditional on 2,, with density function 


> ыу 1 г l/z—puY rdle l a 
I (a | 2) = X exp{-4(3") he КЕЛЕ “ди (-9 <= < т). 


c (27) 
However, for the purposes of this work, since * and 1“ are symmetrical in 21-10-10 ye 
need no longer regard them as ordered. In effect therefore we no longer need the individual 
values of &(y;), 6 (y) and ТАСЫ)! mentioned іп the introduction. 
In a previous paper by the author (1958) it was shown how it was pos 
moments and product moments of 2,1 and a, in terms of a linear sum of integr 


yp Ред 9а, 


sible to obtain 
als of the type 


" EP 1 үр 
ул(р,,т: a,b) = d n [Е (и) 
Е(и) als Z(t) dt and Pr 


d these integrals as a power series in 1/( + 2); tables 
) 0-80. We шау treat moments and product moments 
therefore, 6, % and f may be 
$ the end of the paper as power 
i In addition, it is 
alues for € and var (4) for 
4 у” 
y 
3, EsTIMATION or М 4 7" 


e gi? [1 6.1) (т, сауу, ! 


о = dI,- „= -p)peb 8” 
“т Hn) | с), Е =ч (т, 0) | d М 
ol 4 ;mator for А, then we must have. 


to be an unbi + 
» "y 


where Z(u) - E eo}, = r[(n + 1) 


and also how it was possible to expan 
were given fora +b < 4, p = 0-50(0-05 
E X(x,— 2)? and [E(z; — ®,)] та similar m 
a as power series in 1/(n + 2) though the 
SA 1/(®- 1), since this form 18 thought to conver 
Possible using Sarhan & Greenberg's tables to 8 


l<r<n< 20. 


Writing 


Wi + 
е see that if; = ez, 1 (1 EDLA 
| | є= шы (x 
With this value of € г 0)? 
72 ко)! (jy — io) à 


Шулы 


= Ho) 


уаг(и*) = АЕ Је sizes % < 20. 
м values of e(r, n) and var (и*) are giver in pum 
Or use with sample sizes greater than twenty 
e+ O(n 0 with №7 rint 1) 
e values at 

function of 


p between 


imation to the tru 
hag obtained and will be found to give an excellent werk ae те 
и (correct to the eighth place о: mos view of 2 relationshi 
«(ғ rand n; the complete form e(r, n) b 
>") and e(n—r+1,”) which is give? below. 


e,n) = X Eg) 
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Similarly, we obtain an expansion for the variance of 4% of the form 


ж 
var (4) 
А" 


Р ; аге 
The quantities E;(p,) апа И(р,), which are functions of the ratio р, = r/(n+ 1) andi loda 
given in Table 2 for p, — 0-50 (0-05) 0-80. Values to cover the interval 0-20 « p, « 0:5 

be obtained using the relations 


Г(р,) (n+ 1)-?+ O(n + 1)78. 


5 
rm i=l 


£u Ж пт - 
в(ғ,”) e(n—r+1,n) 


ж 
var le | 
с 


4. COMPARISON OF //* WITH OTHER ESTIMATORS 


n, 


1, var (o) lr m = (И) : 
t; d © nfl —ne(n+1—r,n)/(n—7)} 


қ Wm : f jt. 
Gupta has tabled the large sample variance of й, the maximum likelihood estimator E ү 
Comparing these tables with the leading term И(р,) (n -- 1)-1, we obtain the asymp 
efficiency of 4% for various values of pr 


Asymptotic Asymptotic 

Pr efficiency Pr efficiency 
0-20 0-9485 0-55 0-9711 
0-25 0-9479 0-60 0:9764 
0-30 0-9491 0-65 0-9813 
0-35 0-9519 0:70 0:9857 
0:40 0-9559 0-75 0:9896 
0-45 0-9606 0:80 0-9929 
0-50 0-9658 


iye 
We may also compare the efficiency of 4* and де; (Gupta’s modified estimator), relati 
to Мү, the most efficient linear unbiased estimator. 


Thus for a sample of size ten we have 


о 
——— 
= 1 
N ИИ 


т AIME var ду, 

var иж Var jig 
2 1.0000 1-0000 
3 0-9819 0-9150 
4 0-9756 0-8983 
5 0-9771 0-9113 
6 0-9825 0-9354. 
7 0-9890 0-9663 
8 0-9946 0-9806 
9 0-9985 0-9943 
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It should also be noticed that e = 0 whenever p, = r|(n+ 1) = 4. Thus if we are given the 
dian, we disregard all but the median value in 


at first sight seem а great waste of data, it will 
hat the asymptotic efficiency of the median 
(variance p“) takes the values: 


observations up to and including the me 
forming an estimate of у. Although this will 
be observed from the first table in this section t 
in this context is 96-58 %, whilst the ratio (variance ш)! 


0-9899 when л = 5,т= 3; 0:9843 when % = 7, г = &; 


and 0.9806. when %= 9,7 = 5. 


re available, very little is lost by neglecting all 
ate of 18 concerned. In contrast, the asymp- 
e sample of n is 63-66 %. 


Thus when only b(n + 1) of the observations à 
but the median observation as far as the estim 
totic efficiency of the median relative to the com plet 


5, ESTIMATION or c? 


: r-1 (= Ер) 
Defining Hi; = e а Р p lá 

=1 Е 

r-1 қ i 
veniet ge ned "p (x; — 2) 

іші si 


is unbiassed for о? and of minimum variance Over all possible (a, f) if 
"m ғи” 9H pH Hai) 
a= ur Has - Ho Ho OER ont HO Нұ На Ш 
? Нур эН Ho Hn 


p= (н Hg — Hio Hi (RH T 


and that with these values of 2 and В 


Proceeding as indicated in $2, we find exP 
А -5 
а= X Ap) "ELE , 
ізі 
" -6 
"EX уе 
i-2 
4 -i+ 0(n- D 
в/в) = X MU yox D 
var (7*0?) A ғ 
Values iven in Table 3- i f 62, the maxi- 
of A; Я а W,(p,) ате & f the variance 01 9» 
By com (р, Вір!) ү ba with the large sample form © totic effi ciency of 100 у for all 
trim ШЕ же; AC) к ТЫ has asy™P 
ike timator of 0^; 1 
ihood estimator (rn), Ar, m), var t men 


Values of Е : 
the ratio r/(n + 1): tions connecting 047» "| 
Tt has not been че possi нее nc in auxiliary tables. How- 


a ple to give simple? without bringing" In 
n Y Ы any case, an 
in (n —r 4 1,n), f(n—* pn)» vh ӨКІ! meet most requirement "i : voit 

er, it is hoped that the existing = кто мй more than half the observ 
relia 


esti 
es, nate of variance will be most un 
*nsore, d. 


ry om 
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6. INTERPOLATION IN TABLES 2 AND 3 

Owing to the labour involved in producing the tables, it w 
entry for p.. However, the following method of interpol 
Suppose that n and r are given and that Ф, = r|( 


as not found possible to give a fine 
ation gives satisfactory results. 
n+ l)Jisnotatabulated value. Let U(0:65 


5 г А аѕ ап 
denote, for example, У E;(0-65) (n+ 1)75; then if h = 20r[(n--1)—13, we may use as 
ізі 


approximation to U(p,) the expression 


МА? — 1) (82 — 4) i2 — 9) Бес 


6U(0-55) | 15U (0-60) 200 (0-05) 


720 h+3 ht+2 "hel 777 
18U(0-70) 6U(0-75) "es. 
ы А-1 Аа h—3 


This corresponds to Lagrange 7 


-point interpolation and is reasonably quick to apply. 
Asan example, using this me 


thod with r = 7," — 10, we obtain 
€ = 0-38695262 


» Whereas the true value from Table 1 is 0-3869 5208; 
var (“*/o) = 0-11796547 


» Whereas the true value from Table 1 is 0-1179 6680; 


= 0. be 
тео No check is available on these values but they are thought to 
"4 NM correct to the number of places of decimals given 
var (7*/g?) = 0.3815 
3 


>” = 17, 18 апа 19. Checkin 
found to be 1.1075, 3.10-5 апа 2.1074, respectively. 


7. А WORKED EXAMPLE 
The following example is taken fr 

The table gives the day on w 
inoculated with a uniform cultur 


om Gupta’s paper. 


hich the first seven of ten tested mice died after being 
e of human tuberculosis, 


Day 41 44 46 54 55 58 i^ 

Гов ау = (=) 1-618 164 1668 173829 i549 ле; 1477 

Frequency 1 1 1 1 1 1 : 
Tt is found that 


6 
Ж = 1:6925, a, = 1-778, PLE — 0-062191, [ 
а i 


д* = (0:3870) (1-6925) + (0-6130) (1-778) = 1-745, 
7* = (0-1938) (0-062191) - (0-01767) (0-263169) 

Ап estimate of the variance of д» 
Ап estimate of the variance of 7* 
In passing, 


6 2 
б—)| = 0.263169. 
-1 
'Thus 


— 0-00740. 
is (0-1180)p* 
is (0-3815) ужа 


= 0:873 x 10-3. 
= 0-209 x 10-4, 
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Ty REN 
2 n е. Similarly, for a rough test of the hypothesis 7 = 0% we could refer 7* to the 
"n "t ype III distribution with the appropriate mean and standard deviation. These 
problems have yet to be fully investigated and will probably form the subject of a later 


paper. 


Iwi " Р қ 
wish to express my gratitude to Prof. E. S. Pearson, whose suggestions are responsible for 


8 Р 
everal improvements оп the original paper. 
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I V 1 E P 20 > 
able 1 alues of the weight в and the variance of ис for 1 < т < % & 1 
| Weight, € var д®/с Weight є | var p*/o 
| gnt, н 
к б .917+ 597 
n= 3r= 2 Zero 0-4486 7110 | n = 11, ~ = — 2.0245 4630 1:2178 65 


5 
A бағы .4573 1320 
3| —1:2238 3557 0-4573 7 : 

| Е 4 —0-69600094 | -2569 4929 

| 5 
9 


| 30705 

poe T -1763 072 

n= 4,r= 2| —040555159 | 0:5129 9280 — -3060 1488 | 

3 -3093 2088 | -2882 4812 қ c — 8548 
= 7, 024925991 | -1160 iet 
291 8 4572 0905 | poe oe 
n= r= 2| —0-74110682 | 0-6111 918 я “63322013 +007 m 
4 «аша | sais "032 ui iod ом 
= | = — ee жа 43 

n= 6r- — 1.0260 6712 | 01864005 |" = 127 = —2.1728 1803 | 1:3044 1 


" 

2 359 382 
2 3| —136780736 | 0-4850 3821 
3 | —0:2676 7985 | -3041 5115 4 -083565351 | 27205514 
4| (22269148 | -20895708 БЇ ЕНЕ | актов 
5 :5735 0999 | -1774 4443 


5 6 | —0-13240995 | 0:1398 = 
7 112057511 | “1158 нт 
n= ",r- 2| —1-27331715 | 0-8263 6293 8 .3395 5903 | -1018 jd 
З | —0-50238193 | -3302 2028 9 -51314930 | -0933 eh 
4| Zero :2104 4686 10 :6685 3886 -0881 254 
б | :3642 6439 | -1674 8290 T 
11 0-8324 6899 | 0-0847 288 
6 | 064223186 | 0-1498 1938 
Е 
n = 13,7 = 2| — 231007581 | 1:3868 ert 
n= 8,т = 2| —149153218 | 0-9309 9884 з | —1-5009 4676 | 05208 4250 
3| —0-7109 1496 | -3604 2893 4| —0-96540881 | -2874 m 
4| — 11997 0421 | -21807919 5| — -56867675 | -1914 068 
5 "1737 4521 | -1646 3021 Я 
| 6| -02560 4852 | 0.1434 7507 
6 | 046239456 | 0-1409 3561 1 Zero .1167 Ши 
7 :6920 0548 | -12981932 M 2152 0920 | -1009 0906 
== 9 :3993 2369 | 0910 а 
10 "55874621 | “084759 
n= 9,r= 2 | —168675767 | 1:03129527 06 
3 | — 0:8983 5837 | 0-3924 5050 11 0-6977 1536 | 0-0807 o 
4 | — -38028184 | -2292 5388 12 81888582 | -0782 83 
5 Zero | -16610128 | -— 
6| 029005268 | 0-1368 8836 | „ _ _ m .4653 7120 
7 :53446005 | -1221 7529 ка : Z ges m emis en 
8 772970989 | -1146 2569 4| —1-0865 6888 | -3029 p 
3 — (€ E 5 | —068715159 | -199840? 
n-—I0r- 


2 | —1-8633 5148 | 131269 0294 8 | -pangan | 05478 oe 
3 —1:0685 0751 | 0-4250 4749 7| — -1133 1319 | +1186 0683 
4| — 0-5448 8270 | -2425 0203 8 110455479 | -1009 38 
5 -1703 1365 9 

0 


| — 1592 4349 | 9915 1408 | -0897 a 
| | 1 4540 6085 | -0825 04 

6 | 014237402 | 0-1359 7144 | 98 

7| -38695268 | -11796680 11| 0459062306 | 0:0777 19% 

8 | -58963408 | -10808017 12 1999 2478 | -0745 62 

9 75925335 | -1026 6399 


09 
13 “8327 4358 | -0725 58 


J. С. Saw 157 


Table 1 (cont.) 


л #& wre 


5 
= 1:59 
— 1.2001 9350 
— 0:7984 1909 


— 0-4509 6831 
— +2201 2308 
Zero 
-1893 1561 
+3543 9077 


0-4997 5262 
+6285 8591 
:7431 0067 
:8446 4790 


— 2.6696 1409 
— 1:8506 3947 


— 1:3071 6059 | 


— 0-9032 9419 


— 0:5838 7075 
— +3210 9497 
— -0990 2045 
-0922 8706 
-2594 7023 


0-4071 0746 
-5384 7038 
+6559 1509 

-7610 9259 


-8549 8345 


— 2.7754 4828 
— 1:9538 0346 
— 1-4082 0482 
— 1-0024 6439 


— 0.6812 3676 
— -4168 0820 
— -1930 4540 
Zero 
-1689 7949 


0-3185 2094 
-4519 4855 
:5716 9627 
:6795 4769 
-1161 5745 


0-3640 4026 


— 9.8155 5681 
— 9.0514 7109 
— 1:5039 4667 
— 1:0965 1402 


105633 | 


| var pelo Weight, € «ағ pelo 
| d 1805 | п = 18,7 = 6 | —0-7737 6245 0-1692 4581 
-5162 8847 7 | — -80776661 | “13080404 
3184 5172 в | — -28250321 | “1064 6866 
. -2085 8619 9| — -08794308 | :09103752 
10 .08259311 | :08028344 
| 0-1527 6888 
11910 5385 ул | 023376240 | 0:0727 5085 
-1016 9465 12 .36892384 | 0673 7399 
-0892 0543 13 .4905 5815 | 06348909 
-0810 9450 14| 60051947 | “0606 6704 
| 15 0018374 | “05862267 
| 0-0755 6299 
| .0718 0372 16| 079051758 | 0:0571 6368 
.0692 6842 11| 87204112 -0561 6310 
22106761792 Pen 
аі % 299705 2895 | 1:8100 4988 
1.6121 0956 3| —214419951 0.6910 0073 
0.6097 5853 4| — 15949 1464 13780 2346 
3337 6521 5|- 1.1859 4337 | 2457027 
551779 
иле в | — 0-8617 6868 0:1750 4878 
Без т | — 59440913 11345 4868 
| қ м veri g | — 3678 0590 +1091 3104 
12; g | — 11190350 -0923 2730 
| 10304160 7 0807 9098 
| 108043219 10 ero 
0803 4191 11| 01525 8385 0-0726 4730 
2 “28099182 | “0667 8006 
(е ме Та 711245526 | 0024 9205 
0697 8395 14 .5241 5083 10593 3084 
0667 6856 МЕК | “0569 9385 
0646 9077 
0633 1093 16 0:7183 5714 0-0552 7424 
| 17 80269911 | 0540 2945 
| 18| 87015096 | 0531 6423 
1-6808 1527 gus 
0-6376 708 БЕ аа | 
3488 6837 "TS in 3.0608 6752 | 18709 1592 
2965 2675 | ^ 7 ^" = 3| -22324 6540 | 07164 7801 
4| —16815 6241 .3925 8856 
0.1635 5464 g| -1211 g372 | “25351721 
| .1972 575 | 
| 210474082 | —0:9457 1364 0-1809 1765 
| 0900 4658 7 | — 91111928 -1384 3893 
0301 0670 g| = 4893 0972 | -1116 5405 
| 9| — 2522 0650 | -0938 5532 
0:0732 1078 10| = 090 3873 | -0815 6600 
0633 4595 | 
0648 8380 п | 0:0747 4052 | 0-0728 3429 
0624 2044 12 "2126 8352 | ‘0684 9576 
| 40606 9067 13 .3372 1331 .0618 2136 
14 45043978 | ‘0588 3664 
| 0.0595 2219 15 55264589 | `0557 2300 
16| 00480 4129 | 0.0537 6121 
1.1467 2583 i7| 7844 волт | -05229780 
0.6647 4230 18 | .8135 5787 -0512 2505 
3637 2129 19 .3860 9203 | 0504 6728 
> SL Le 
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Table 2. Coefficients of (n+1)-* in the series approximation to (a) the weight factor 
and (b) the variance, of the estimator of location, и* 


(а) є = X Ep) (n+1)-§+O(n4+1)-* with p, = r/(n+1) 
i=0 


| | А jx (рг) 
р Бур) Бүр) Bap) | Вар) Edp) | Бір 
- — ——— 
0-50 Zero Zero Zero Zero 2016 Е. 
0:55 0-14865772 0-0783 132 0-006281 —0:04107 - 0:0201 161 
0.60 | “28935012 1370787 | 001435 | — -07785 | — -0248 
| | 0273 
0-65 0-40340719 01818639 | —0-012337 — 0-11338 — 0-0159 y 
0-70 :51356217 -2138 610 — -034313 — -15044 :0100 63 
0-75 "61418208 -2352568 — -065129 — ‘19253 :0646 438 
0-80 ‘70631072 | -2468760 — -107286 — -24566 :1784 1 
| — 
" "ТЫ 5 - e” 
(b) var =) = Vy) (п+1)-1+0(т+1) 
i-1 
кенес 
Pr Ур) Ур) LACS) Vi(p,) Volpe) 
0-50 1-5707 963 0-896605 0-0987 3 — 0-4265 = 0-132 
0-55 1-4169573 716520 -07237 — -1662 "275 
0:60 13034798 -633049 17656 :3218 :970 
0:65 1:2186755 0-605297 0-32705 0:5913 1-481 
0-70 11547730 -610085 48554 9072 1-840 
0-75 1:1064195 :634096 -63859 1-1676 2.090 
0-80 1-0698709 :669408 18204 1-4575 2.333 


Table 3. Coefficients of (n 1)-? in the series appr 
used in estimating аз 
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oximation to the weights ® and В 
and for the variance of this estimator 


| = 
Pr A (Pr) A a(Pr) | А.(р? Asp) " 
mu pe | 
0:50 2-482906 5:61883 6:8137 16-886 
0-55 2.213248 4-34260 4-3813 10-188 
0-60 1-989519 3.40329 9.7795 6:215 
0-65 1-800852 2.69428 1-7316 3.852 
0-70 1.639499 9.14687 0:9989 2.226 
0:75 1-4997 74 1-71526 0-4764 1:276 
0:80 1-317385 1:36769 0:0451 0:515 
EMT акын 
Pr Bp) Вз(р,) В.р) Bs(Pr) 
Ж MEME —— 
0-50 — 1.517094 — 9.10572 — 11-8257 - Pu 
0-55 —1:603473 — 785637 — 89535 - M 
0-60 — 1:6046 46 _ 642606 — 61106 - 
0:65 | -І-560119 2520056 | -Z pum Е p 
0:70 — 1-491833 — 440321 = pero Au 
0-75 | —1412189 74617797 |Z un Z 3438 
- Ee _ 
| Wo 
Pr (р) Walp) Walp) к 
Кы mue И 
61363 32.610 
0-50 4-965812 9.05840 ud 2441 20.516 
0-55 4.426496 ны M BU 
0:60 3.979038 5.66638 М 
5.6512 8 
0-65 3-601709 455406 4.0015 5:446 
0:70 3-218998 cmn 2.8306 3:585 
0-75 9.999548 3-001 1.8451 1-902 
Е 9.754769 
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ON HOTELLING'S GENERALIZATION OF 7? 


By К. C. SREEDHARAN PILLAI} anp PABLO SAMSON, Ун 
The Statistical Center, University of the Philippines 


l. INTRODUCTION 


А ЖИР 71241008 
Let S, and S, denote the matrices consisting of the sum of squares and products of dev co 
from means for two independent samples of sizes n, and ns, respectively, taken fro 
p-variate normal populations. Then T$ (Hotelling, 1951) may be defined by 


ТЕ (т, —1) = trace (Sz! S). 


+ . . . . . B 2 S its 
It is a generalization of his 7? Statistic. Since the trace of a matrix equals the sum of 
characteristic roots, Т? 


0/02 — 1) is the sum of the characteristic roots of the matrix Sive 
Define (79 as the sum of the characteristic roots of S218, where s is the number of non-zero 
roots. Thus s = pif n,n, > p. It may be noted that 0 < U™ < оо. А 
We give here expressions for the moments of U® fors = 2.3, .... 6. Upper 5 and 1% points 
of the distribution of U® have been computed for s = 2,3 and 4 for various values of a 
supplementary parameters using the exact moment quotients of U'9, Comparisons of thes 
values fors = 2 are made with those obtained from the exact distribution, and those from ha 
approximation by Pillai (1954). For s = 3 and 4, comparisons are made only inh 
approximation. A numerical example is also given to illustrate the use of the criterion U^^ 


2. THE MOMENTS or [76 1 
By making use of а recurrence relation given by Pillai (1954, 1956) the first four centr? 
moments of U for s = 2 3,4, 


5 and 6 have been obtained. We shall 


= 2, 


write 
т-(т-ө-2) n= т —s— 2). ны 
The nature of the first three moments suggests the following general forms: 


mu = sim: i (s +1)} " 
т 


(2:2) 
ир = те D) Que m M 1) (оп а) (2-3) 
n?(n — 1) (2n 4- 1) D 
up те Dome go D) (n+ 2m +s+ 1) (2% + 25) (2n 4-5) (2% 
n*(—1)(n—2) (2n £2) ( 2 s 


2n+1) 
Although the fourth raw moment of U'? has also been generalized (Pillai & Mijares, 1809), 
the central moment has not yet been obtained because of the heavy algebraic computation 


NT 
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involv " 
volved. The fourth central moments of UY fors = 2,3 and 4 which are used for computa- 


tion i 4 4 
n in the following sections аге therefore given in their explicit forms: 


“2? Аз 
where n*(n— 1) (n—2) (n—3) (2n 1)(2n— D (2:5) 
PEE “ 
a2 = 3(2m + 3) (2n + 2m + 3) {[4n? + (2m + 3) (2n-- 2m 3) (n+3)] (п-1)(а- 2) 
+16n(n+ 2m + 3%, 
Аз т” 


T TR | 
“Неге dis (n+ 1) пт 1)(n—2) (n — 3) 2n + 1) Qn— 1)? 
9)[—2n*+ (m+ 2) (n+ m+ 2) (1200 ау 


Аз = 9(m 4-2) (n m 2) (n 1)(n— 
Y [8(n- 4j — 38(n-- 2 — 240 4) 134], 


— 88(n 4- 4) + 125] +(®+ am+4 

nu = = SI EN —— À eet (2:7) 
(n+ 1) n(n—1) (n—2)(1— 3) n +3) (22+ 1)(2n- 1) 

where 

an? + (2m 4 9) (2n 2m + 5) (n 5p 

4 (n4 2m 5)? [2m 5) 

90(0+5)+ 110i 8)— 224]. 


Ay = 24(2m + 5) (2n + Әт--5) {(n+ 1)(n— 2) 


— 12(n 4-5) 4- 44(n-* 5)—49}] 
—17(n+5)+ 
Hence, i(n +5) 
(9) — 4(n + 2m4- s 1) (n 25) (0D (2n #1) ен (2-8) 
DÀ sim + (8+1 ) {n +m +8 L 1)} (2n +5) (п- 2y (2n+ 2)? 


(2) — (n — 1) Qn 1) Aas за - (2-9) 
= (2m + 38)? 2n + Im + 3* (w+ 1)? (n-2)(— 3) (2n- 1) 
(2-10) 


(з) _ (n — 1) Qn P NE СЕ 
BP = mpap (w+ m+ 2) Ont Sy Qn - 1) e D (720-9 
j 1) А MM ( 
and a (n 1) nt ^8 8—5 
и 4(2m + 5)? (2% + 2m + 5) (n 2? (20+ 3) n- D + lans 
ined 
th It may be observed that the moments of yi can be un уот А). In the numera- 
sme of V9, another criterion which is defined аз the = т : a highest wat of n positive 
of any moment of V9, if one maintains the coefficient 0^ © : 
А в of n, then substitute 
Гы put alternatively positive and negative signs for subsequen ратын ші moment of Ue, 
B = the value m+n +s+ 1, one obtains the numerator of the cor p кейі 23 ene 
ау the denominator of any momen S airo volving n and negative coeffi- 
Pe ent of V9 by putting positive сое e £m ns Horn. 
ts for other terms in each linear facto 


2-11) 


in a simple manner from 


ION OF Uo 
B. Dus DREES PERCENTAGE З 2-9) for parameters 
s of n and smaller 


The 
mom i 2 n compu 
ent quotients, JY and 29, have bee О (except for a fe Жа сонне 
“Та 


т, 
- 1(0- 5) 5(5 ко: 60, 8 
30: Bp and ЛЕН МА ы ro 
alues of )30; . h evaluate К & 
m). The u Бапа 19 oints ате e 3 rdized measure ped 
pper 7o P expressed in standa Biom. 46 


Poi 

nts 

е of Pearson curves for given P1 and ра 
I 


хт OF 
ч 2-8) and ( 
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E and 1 of this 
ley, 1954, Table 42) and are shown in Tables 4 and 5 printed at the enc 
Hartley, 1954, 2 
; е 
ш” i х р.ғ. of U in terms of incomplet 
Hotelling (1931, 1951) has obtained the exact с.р.ғ. 
f.-functions and it is given below: 


* " 4 i 1-ш n4l 
(2n 4- 2m 4- 4)! Еге 


А 
Pedum ps melee ir^ 


31 
Вия (m+1,n+2), (8:1) 
where w = ГФ/(2-- U), 


тылы үр ud тө) in the form 
Pillai (1954, 1956) has suggested an approximation to the distribution of U“ in 


[79а (3:2) 
> Ue) = (ГӘ) sn-cm4-3(6541) 1 
B(s[m- Ms 1)], sn + 1) ste ( + =) 
and the approximate с.р.ғ. for s = 2 can be shown to be (8-3) 
d P =1,,(2m+3, 2n+41). 


A comparison of percentage 
and approximate C.D.r.'s can 
Pillai’s observation ( 
negligible for larger 
distribution could b 
larger, say around 4 
the exact 0.0.7. оп] 
closer to the exa 
observations just 


-act 
points from the moment quotients with those from а 
be judged from Table 1. A study of these tables re quite 
1954) that the error in using his approximate a append ate 
values of n; for example, for the upper 5% points, the s little 
e used when т is 30 or more; for the upper 1% points, when % en б 
0. The percentage points from the moment quotients differ i: jatively 
y in the third significant digits for large values of n, and appear re These 
ct percentage points than the approximate percentage points. + 


s e of the 
ify the use of percentage points from the moment quotients or the u$ 
percentage points from the approximate с.р.ғ. for larger values of n. 


А (2) 
Table 1. Comparison of approximations to the upper percentage points of U' 


5% points 1% points 
a a From From From From From From 
exact approx. exact approx. Ba Ps 
C.D.F. C.D.F. Ba В, C.D.F. C.D.F. 
0 5 1-43 1-39 — 2.17 2-05 a 
0 10 0-671 0-664 — 0-979 0-931 AA 
4 10 1-927 1-886 = 2-516 2-415 a 
0 15 0-443 0-435 — 0-624 0-602 29 
4 15 1-236 1-219 = 1:563 1:526 zu 
0 20 0-328 0-323 — 0-455 0-443 З 
3 20 0-7690 0-7600 0-7683 0-9698 0-9499 00109 
35 | 20 0-839 0-829 0-842 1-050 1-029 1-088 
й 20 0-908 0-898 0-907 1-130 1-108 ju 
5 20 1-046 1-034 1-045 1-288 1-263 bese 
0 30 0-2155 0-2124 — 0-2960 0-2904 598 
3 30 05020 | Odass | бгз | 02960 06165 | 0:623 


№. 


K. С. SREEDHARAN PILLAI AND PABLO SAMSON, JR 163 
4. UPPER PERCENTAGE POINTS OF THE DISTRIBUTION OF UC?) FOR CASES 8 = 3 AND 4 
The moment quotients, Д, and дз have been computed for various values of the parameters 
m and п using (2-8), (2-10) and (2-1 1)fors = 3and 4. The percentage points were then again 
evaluated from Table 42 of Pearson & Hartley (1954) and are shown in Tables 6-9. 
Making use of Pillai's approximate C.D.F. given by expression (3-2), we have for s = 8, 


p = L(3n-46,3n 1, where w= Uey(34- 09) (4:1) 


and fors = 4 
(4-2) 


| 
р = 1,(4т+ 10, 4n+1), where w= UM)|(4+0®). 

For comparison with the upper 5 and1% points from the moment quotients, percentage 
points were also evaluated for certain values of m and n from the approximate C.D.F. given 
in (4-1) and (4-2). The results are tabulated together with the corresponding percentage 
points from Д, and Ёз in ТаЫез 2 and 3. 

м ; 3) 
Table 2. Comparison of approximations to the upper percentage points of U* 
5% points 1% points 
т = 5 
т = 0 т = 5 
! т 
и ^ From F From 
s From From: 2 rom | approx. 
From From á Ж approx. 8. В, 
гох. prox: „га C.D.F. 
By Ва pom By Ва с.р.Е. оо 
E 2.560 2-462 
15 = == 9.120 2.076 E 1:832 1-798 
20 мегі == 1:557 1.533 E 1-449 1-415 
ойға | 1490 1-168 
5 0:9943 
0:3867 | 1009 
+3376 0.8759 E 
40 +2692 9671 1590 .9995 1186 1680 
ed ән) 4m] 5| d je. | se 39 
0 4 к .5977 5 
9144 2133 597 o2227 | 05728 0:5693 
60 0-4936 Ms | 6: 
0-177 -1774 0-4959 1 А 
80 ава СОЕ IN ae қа) dé | МУ 
1 


+2947 


100 -1062 1060 2949 


oin 
Te Study of these tables shows that the pere b S? tributions ap 
larger values of n and hence the appro " ; 


5% 


11-2 


25 = = 1-229 1:213 

30 0.3621 | 03575 | 1018 1-005 

35 0.3097 | 0:3055 | 08638 0-8571 
483 


164 On Hotelling’s generalization of T? 
Table 3. Comparison of approximations to the upper percentage points of U® =. 
- 48 
5% points 19, points 
E Р mu uM | uu 
m=0 m= 5 m= 6 m = 5 
n = 2 ааа ы 
г тө From 
From MA From | Ed From UND From | approx: 
X. OX. Хх. 
By Bo Ep Ay, В» Cm By Ё. | in By fs C.D.F- 
L | o NIE pee 
15 1-122 1-084 2-877 | 2-812 1-413 1:334 3:395 3:246 
20 | 0.8272 | 0-808 2-113 2-085 1-028 | 09841 | 2458 Me 
25 -6548 -6415 1:668 1-611 0-8069 "1800 1:925 dr 
30 84422 -5342 1:378 1:373 -6644 | -6478 1-581 1-08 
| 
35 0-4629 | 0.4572 1-174 1-172 0-5646 | 0-5530 1.341 120 
40 4033 :3994 | 1.023 1-014 4911 4819 | 1165 ce 
45 :3574 (8544 | 0-9058 0-9035 4343 4264 | 1:029 жс 
50 "3209 -3184 :8130 :8095 :3893 :3821 0-9218 0-91 
60 0.266060 | 0-2651 0-6745 0-6738 0-3924 03180 | 0.7697 e 
80 :1990 :1984 :5030 :5039 "2399 | -2375 -5670 Kk 
100 :1588 -1583 4010 4018 1909 | -1895 4512 “4b 
= - nuege 


5. AN EXAMPLE i 
The criterion U is useful for testing different kinds of hypotheses in multivariate analy pn 
one being the comparison of two independent estimates of the same dispersion matrix: 
A second problem in which this criterion can be used is in testing the equality of р el 
sional mean vectors of J p-variate normal populations (which is mathematically identi 
with the general problem of multivariate analysis of variance of means). The values of man 
п appropriate for this test are 
( 5:1) 


^-ii-»-1|-à »=4W-I-p-1), 263) 


where М is the total of I sample sizes. The following data are taken from Rao (1952, P 


ч ren. 
he same age belonging to six differ 


trices consisting of the sums 04840 
е three characters. Then 

1 2 3 
752-0 214-2 52153) 1 


S* = 15-3 401-2) 9 i 
1612-7] 3 
1 2 3 
а 1003-7 2671-2) | 5:3) 
[m 1499-6 — 4123-6] ә. " 
3 


21,009-6 
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The value of U® = 0:2 А 3 

та | 201 6 is obtained as the trace of the matrix S7! * = 

that iie ie i ecd HUIUS. Referring to Tables 6 and 7, with EG кез ue s parer 

desila aan signis ag just significant at the 5% level, which falls at about 0-195 and i 

Pillai’s же iar 5 the 1%, level, 0-242. The corresponding percentage points мз 

af Rew, who ae c.d.f. are 0:194 and 0-239, respectively. This agrees with the findings 

(1957), ы, pe the data using the А criterion of Pearson & Wilks (1933). Foster 

| ШШ тыы E e : that the largest root is significant only at the 15 % level. 

diu different тов И par d example different test functions based on the same roots 

ыға ааа eed not be regarded as surprising. Кат too little is known at present 
ves to the null hypothesis to which these different tests are most sensitive. 


The auth Р 
authors w. ы ami 
s wish to acknowledge the facilities offered by the Statistical Center, University 


of the Philippi 
ilippines, during the course of preparation of this paper. 
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Table 4. Upper 5% points of the sum of two non-null roots, UM 


Тор 20 25 30 35 40 45 so | во 80 100 
E | 
| 0-092 
10 +106 
їз 120 
20 -133 
6 
3-0 - 
28 0-159 
зз 172 
45 d» 
t5 197 
10 o 
10 240 
15 «558 
2 074 
25 790 
50 = 
= 
^| 15 90 25 30 5 nii 
a 35 40 45 50 60 80 
— ss 
г él 118 
а | ome | ee | 
sh = : — . 0-304 | 0-273 | 0-226 16 
| = | = (es m җа) зы | ap | зш | шй | 1s? | des 
SM 4 EX 85 | -422 | .372 | -334 27 205 | | 
0970 | -760 | -624 | 4500 | .as8 | 408 | зв | Gol | 224 | 178 
7 = 1059 | 0822 | 0675 | 0-572 | 0-497 | 0-439 | 0303 | 0:325 | 0242 | 9 ir 
45| — | 09101 586 | 225 | -016 | би) 472 | 499 | gag | -250 | 70 
ІС |10 5| 6 | -658 | -571 | 54(| 451 | 378 | -277 | 244 
1005 | 825 | -700 | -607 | 586 | 3480 | 306 | 3208 | 5 
10 | — | 2057 | 1602 | 1-310 09 
№ . 1108 | 0-960 | 0-84 "15 ў ) 4% 
i | amie | Bea | stee | ao | rama | ogm | osae | ozor | om IE IE: 
ш ime ee Toi | tias | ras ЕТЕ ТЕ 
р Г i 5 17 4 * X 3 
30 | 7-092 | 5.043 | 3900 | 32173 | 2673 | 2.306 | 5056 1807 jane 1007 м” 
50 |1128 | 75996 | 6167 | 5.007 | 4202 | 3.626 | 3.185 2.837 | 2328 | 1-711 gu 
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Е т à 
_ Table 6. Upper 5% points of the sum of three non-null roots, U® 
n м | 
aX) 5 20 25 | 30 в 40 45 50 60 so | 100 
| | | | 
$6 | 0133 | 0-106 
08 258 | -126 
ть “183 | -146 
15 907 | -165 
"M 331 | -184 
3m 0:254 | 0-203 
р 1978 | '929 
va 993 | БЕРДІ) 
vit .394 | 958 
tg ‘зат | 211 
Bi ‘369 | 295 
10 0:593 | 0472 
15 зи | -646 
20 1.096 | “817 
25 1.196 | 953 
30 1406 | 1120 
1671 | 1326 


3 


| 


Cue WH OD 
Флоаоа SaaS 


Table 7. Upper 1 % point 


um 20 25 30 
кш — — 0-464 
— | 0.837 | 0658 | 54l 
re +956 +750 617 
Tus 1:071 +840 601 
634 1:186 -930 765 
1-791 | 1208 | 1:018 | 0-837 
por | 1410 | 1106 909 
2102 | 1521 | 1199 -980 
еа 1.278 | 1:050 
Аа ЫС 1.364 | 1120 
560 | 1-852 | 1-449 | 1190 

4-060 | 2-925 

: 2.983 | 1871 
п | 40% 3.101 | 2:536 
TS 5.030 | 3:938 | 3196 
e | eee 4-419 | 3:018 
12 | 6639 | 5184 | 4244 
11.9 | 8318 | 6432 | 5237 


-null roots, Uo 


s of the sum of three non 
35 40 45 50 60 80 100 
о 
| а еке ЕЕ 
а 
р 0:225 0:167 0:110 
: өзі | 0308 oan | 0225 | 0105 | 186 
0-394 ‘353 | X! A 593 | 2177 
459 400 ^40 361 -299 : 
i Zog | c9 | 2198 
.594 = .451 404 3 35 n 
.587 a 499 447 +370 
d À D „9. 
оли "us 9% | 5% p .339 | 279 
Th | 89 | s Ba | 5| Ste | 20 
э | m | m 513 Big | l| 1319 
-891 £1 i 4 
950 | $% T s өзі 4n | 399 
009 | ^" ы 
i |) ou | ae | 088 | | 
1:583 1:372 1.639 1:465 S Ew 16 
270 1338 ae 185 Газ | 1285 1.021 
‘лот | 2 | 5 ; i 
2206 2058 248 24 | 2029 |! 505 | 1196 
` Я . 428 
pe өз | 534 gogo | 2451 1805 | 1 
4400 | 38 
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) 
Table 8. Upper 5 % points of the sum of four non-null roots, О“ 


100 
10 15 20 | 25 30 35 40 45 50 60 so, 
; .167 | 0133 
— | 0-712 | 0-547 | 0-453 | 0-387 | 0-337 | 0-299 | 0-268 0-223 йи 159 
06 — | 1122 | -827| :655 | -542 463 | -403 357 | 391 | -267 mer | «194 
Ge — | 1302 | -961 | -761 | -630 | -537 | -468 | -415 372 | +309 281 | .209 
| 10| — | 1-482 | 1-092 | -865| -716 | -610 | -531 | -471 426 351 ЕЕ! 
| 15| — | 1-660 | L223 | .968 | .800 | i682 -594 526 | 4472 | -392 E 258 
20| — | 1837 | 1-352 | 1120 | -884| .753 | -656| -581| .522 | .433 2 к 
.982 
7 к 4474 | 0354 | 02 
8| — | 2012 | 1480 | 1-171 | 0-967 | 0-824 | 0-718 | 0-636 | 0-571 | 0-47 i 306 
30 — | 2186 | 1-608 | 1-271 | 1-050 | -895 | -780 “691 | -620 515 boe 1330 
35| — | 2359 | 1-735 | 1-371 | 1-132 | -963 | -841 "745 | -669 25s | da | «ok 
| 40| — | 2532 | L862 | 1.470 1214 | 1:035 | -902 | -799 "MT 595 aas | 2378 
45| — | 2710 | 1-988 | 1-569 | 1-296 | 1-104 | -962 | .859 | 2764 Eon боз | A0 
50| — | 2877 | 2113 | 1-668 | 1-378 | 1-174 | 1.023 | 1906 | 813 | 67 E 
-632 
5 Б : * 64 | 0.793 | 0 50 
10 | Т17 | 4578 | 3-353 | 2-644 | 2-183 | 1-855 | 1.618 | 1-430 | 1 384 | 1-0 :79 85 
15 | 9:84 | 6-266 | 4-581 | 3-610 | 2-978 | 2-533 | 2-203 | 1.949 | 1.748 1 42 1 1-083 
20 | 1215 | 7-404 | 5-803 | 4-570 | 3-768 | 3-204 | 2-784 | 2.403 2-207 | 1-828 1.641 | 1-300 
25 | 15-15 | 9-622 | 7-021 | 5:526 | 4-553 | 3-869 | 3-363 | 2.971 2.665 | 2.207 1.920 | 1:528 
30 | 17-80 |1150 | 8-238 | 6-481 | 5-337 | 4.533 | 1.050 | 3.404 3-121 | 2-583 92 Е 
2.4 
50 | 2840 11798 13:09 |10-29 | 8464 | 7.184 6.238 | 5-505 | 4-938 | 4-082 | 3:031 
S 4; 
Table 9. Upper 1% points of the sum of four non-null roots, U® menit 
i : Ka a аман алалы 
100 
TU & 15 20 95 30 35 40 45 50 60 80 
m mae 
Е 0-163 
-05| — 213 | 9908 | 0687 | 0-565 | 0-480 | 0-418 | 0-370 0-331 | 0-274 0:206 191 
0-0) — | 1-413 | 1-028 | .807 | .664 565 | -491 | -434 | .389 | -322 -240 | 919 
05| — | 1:620 | 1-177 | 4995 | .761 "647 | -562 | .497 | .446 | -369 | -275 .246 
10| — | 1-824 | 1.325 | 1.040 | 2856 ‘727 | +632 | -559 | .501 | -415 | -309 .912 
| 18| — |2025 | 1470 | 1.154 | .949 "807 | -701 | -620 556 | -460 | -342 | 7599 
20) — | 2224 | 1-615 | 1-399 | 1.042 '885 | -769 | -680 | .609 | .504 | -375 
.325 
25| — | 2421 | 1-758 | 1-378 | 1.134 0-963 | 0-836 | 0-739 | 0.669 | 0-548 | 0-408 0:350 
30| — | £618 | 1.899 | 1-489 | 1.225 1:039 | -902 | .798 | .715 | .592 | -440 -316 
95| — | 2-813 | 2-039 | 1-599 | 1.314 L116 | -969 | .856 | .767 | .635 | -472 401 
40| — | 3-008 | 2180 | 1.708 | 1.402 1:191 | L034 | .914 | .g19 | .678 :504 | 496 
451 — | 3-202 | 2-319 | 1-817 | 1.493 1-266 | 1-100 | 972 | .870 | 1799 535 | 451 
50) — | 3-395 | 2-458 | 1-925 | 1.581 1341 | 1-165 | 1.029 | .922 | .763| -567 á 
| .69 
10 | 8-57 | $305 | 3-827 | 2-991 | 2452 | 2.075 1-803 | 1-591 | 1-425 | 1.179 | 0-875 ШҮ! 
15 | 11-66 | 7-189 | 5-177 | 4-088 | 3.309 | 9.802 2-430 | 2-144 | 1-919 | 1-585 | 1-175 1.169 
20 | 14:74 | 9-065 | 6-520 | 5-079 | 4-159 | 3.521 3-050 | 2-690 | 2.406 | 1.986 | 1-472 1-403 
25 | 17-81 |10-94 | 7-857 | 6-115 | 5-004 4233 | 3-666 | 3-233 | 9.899 | 2385 | 1-767 1.63 
30 | 20:88 |1280 | 919 | 715 | 5.85 | 494 440 | 3-4 | 3.37 | 278 | 2-06 
.55 
| 50 | 3813 |2026 1451 (11-27 |920 | 7.77 972 | 5-91 | 599 | азб | 322 |25 
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TABLES FOR WALD TESTS FOR THE MEAN OF 
A NORMAL DISTRIBUTION 


Bv ELIZABETH D. BARRACLOUGH np E. S. PAGE 
Durham University Computing Laboratory 


1. INTRODUCTION 


If vis 
wis Я к : и 
а, normally distributed random variable with unknown m! 
пе hypothesis 9 = 0, 


devit 
22 с, the Wald sequential test for tl 
с 1 (б > 0,) is specified as follows (Wald, 1947, pP- 117 et seq.): 
ontinue taking observations 1; 28 long as 
0,—0, $ п (08—08) ak а) 


a cm vasta 
а? ES i" 20° 
atisfied, stoP sampling and accept 0 = 0, similarly 


ean 0 and known standard 
against the alternative 


ift Р 
= left-hand inequality first fails to bes 
pt 0 = 0, if the right-hand inequality fails. . 
tting the cumulative sum Ex; оп 


In practice the test is usually carried out either by plo В ) 
à are drawn and stopping sampling 


а ў А | 
: oi on which the parallel straight line boundaries 
im one of the boundaries has been crossed ог by comparing the cumulative sum after each 
и вы and rejection numbers calculated from (1). 
2 
e y, = ui- 05090 " 
һе inequalities (1) may be written 2 M 
и 2 Ў һ-2. 
ізі 
Where Z ac T (b— d) с (4) 
= 70,-% 0,-% 
jth unit standard deviation and mean 
(8) 


T ; р 
he variate у; is normally distributed Y 
ni = {20 = (00+ 0.20, 
hen the true mean 


Where E(x) = 0 
Let P(Z | и) be the probability of accepting the pypothesis thatÓ = e Ей екіде 
as a particular value 0, and п is given by equation (5). The ome са ана 
Fa noambiguity is posito The derivation of the equation i 7 а bs lin 
чы considers expectations conditional upon first observa jor , 
18 case yields е s Lg p} äs (6) 
Pla) (an)? exp - 


P(Z) = o(-4—2)* Г. 


Whe 
s o) = |. 

б eof accepting 9 = 0, when E(x) = о 
robabilit/e? : of the Appendix enable h and 2 


р = 
and [4 = +0, 6o, Р( | №) gives the P e 
n mde aae The dae у TUE pilities- The constants of the test, 
» а selected to give specified values for these two PP еер 
d b, can then be calculated from equations (4). Т 
alae 
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isti se tabulated, it 
hi ired values of the operating characteristic are not among those t = РЕ 
» > ча ovided. 
Б * це т" for practical purposes to interpolate by eye in the charts — ore 
1 У $ » range 
Ja dr the tables were those requested for a particular application, but the rang 
value: 


yw gene inati ха i btained 
is sufficiently wide to be of more general use. Further combinations of and // can beo 

issu ) i 

by using the relationships 


P(h-Z| — 340, —0,)|0) = 1— P(Z | (0, —0,)[0), 
P(&—Z|30,— 0/7) = 1— P(Z | — M0, —6,)0). ғ 
ive the average sa 
Tables 2a-d, of the Appendix, computed from equation (9) below, give the average 8 
numbers, №_ (when 0 = 0) and N, (when 0 = 0,) of each one of these tests. 


2 


2. 


THE METHOD OF USE OF THE CHARTS AND TABLES 

(а) To select a test with given risks | -— 
Ifa test of the mean of a normal distribution with standard deviation ø is required ме 
probability 1 — of accepting the hypothesis 0 = 0, when 0 = б, and probability / 


В 5 uv I found 
0 — 0, (so that the risks of the first and second kinds are 2, |, respectively) it may be 
as follows. 


Calculate jt, 4, where 


(7) 
Ho = -/ал(0-04/2с. 
Charts are given for 


for 
РА es 
= 0325, +05, +075, 51-0, and there are curv 


.99 
А = 0-95, 0:99; 
P(Z| и > 0) = 0-05, 0-10, 0-20, 0-30, 0-40, 0-50, 0-60, 0-70 and P(Z |u < 0) = 0:95 
0-995, 0-999. 
Example 1 


T. 

To select a test with P(Z | 0, = 3-0) = 0-99, P(Z|0, = 3-15) = 0-05 when с = 0:5 

We have № = — а = (3:0—3-75)/9(0-5) = —0-75. 

From Table 1c for д = %075,һ-- 3-88; Z = 1-49. tions 88 
Hencea = —2-13,b = 3-69 from (4). The required test is therefore to take observa: 

long as 


— 0-71+3-375% < Уа; < 1.234 3:375. 


(b) Operating characteristics jy bY 
Several points on the operating characteristic (О.с.) may be obtained approximate е 
visual interpolation between the curves on the charts for the different values of /- 
Example 2 
The approximate o.c. for the test s 


in which tb? 
elected in Example 1 is shown in Fig. Lin w 
circled points are those read from the charts. 


(c) Average sample number 
The average numbers of observation 


is that specified by either hypothesi; 
Example 3 


he mea? 
s required for the tests given in Table 1 when t 
в are shown in Table 2. 


The average number of observatio 


ns required by the test 


30% 
of Example 1 when 9 = 
N = 2-87 and when 0 = 3-75 ig N — 4-00. 


___ за НЕНИ 
ыы 


ELIZABE' 
;ABETH D. BARRACLOUGH аху E. S. PAGE 


са n 


5 4075 410 


4.025 +0" 
xs defining points of the test. 


-10 -075 -05 -0?5 


Fig. Р 
g. 1. Operating characteristic. 


x 
OO, points read from the charts. 


For a giver 3. CALCULATION OF THE TABLES AND CHARTS 
д pair of v. 
method (Page ji at values of Л and д the solution P(Z) to (6) was computed by an iterative 
was вайб 1 9540; Good, 1957). The known function, the normal distribution function, 
4x 10-8 tia by a rational function ар i ings, 1952) with an error at most 
ture нера containing the unknown $ aced by & Gaussian quadra- 
нн” the smallest number of ordinates that would give 8 fairly accurate 
n 

n 

P(Z) = Ф(—— Z)+d wP) EG 2), (8) 
i=l 

abscissae; and K (v. Z)is the kernel of (6). 
r Z= ti = starting from an 


s weights and 
il successive values of the 


where 1 
PA T 
Equation (8) w respectively the Gaus 
Initial a b age used iteratively to obtain P 
P(e) all ттен гче (usually zero). The iteration 6 0 й 
deir ed by less than а small quantity- А ation (8) was applied for 
3? шш , v . 
а go. п’, where у; are the abscissae of 8 Gauss qua ature formula with more 
‚т >n. The P(y;) 80 obtained forme һе starting уа ations with the 
H , until the Р(2) values at a seb 


mon 
e асс 
urat 
e formula. 'This process continued automatic | 
; ру successive quadrature 


of Gaus: 5 
гуи оня Were reproduced to а presen ed ас 
to obtain e а of P(Z) at other points were cal ated and use 
in (8) and > imated Z values for the given Р(2- These estimate 
Some of iy cating the process. 
е finite AS pivotal values of the PI 
Vented atta; erence methods of Fox & 
Correspon Fate of the same accuracy of evaluation of th fourth 
Ose ation pivotal values differ py more than one unit in the 10 т 
е Зен by the first method. 
ge sample number N(Z 
N(Z) = i (е) (ge^ 
9 oted in Table 1. 


Whi 
ch ұу 
as solved by the first method for the tests 90 


equ 


fined by substitution 


(Z) were checked ndel 
odwin (1 ) Limitation: 
i o integral; " 

rt 


) satisfies the € 
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4. WALD'S APPROXIMATIONS 


-— itten 
Wald's approximation P*(Z) for the operating characteristic P(Z) of the test may be RAN 
P*(Z) = (е —e-?P2)](g-*hh — 1); - 


лені ; : i hat 
accordingly the test calculated from this approximation to give risks о, J, i.e. so th 
Рқ2|-ш)-1-а, P*(Z|p) = В 


1... Tew 1, (1—a)(1—/) (11) 
Tas Z 2; log Bo h= Bu log ар Қ 
Wald also gives ап approximation N*(Z ) for the average sample number of the test. 
12) 
N*(Z | и) = [h(1 — P*(Z|u)) — 2]. | 
Thus for the test above (13) 
N*(Z| —p) = (Z—he)/p, (14) 
N*(Z | и) = (h(1 — 8) – и. 


A comparison of P*(Z | д) and P(Z | м) for one value of ^, two values of Z and а аш н 4 
was made by Page (1954а). The present table allows a more extensive comparison. tests 
table 1 shows the Wald approximations 25; f* to the risks calculated from (10) for eU 5 
with risks «, /) at means + д. For jj = 0:25 the approximate risks of the second kind, / | € 
about one-third greater than their correct values, while for д = 1 they are more than t oit 
times greater. Further, for и = 1, no approximations are possible when a = 0-001, f = roit 
for example, since Z < 0 and (10) produces a result greater than unity. A similar behav? ea 
is shown for risks of the first kind. It is clear that for any work requiring an accurate asses к 
ment of the risks for и > 0-25 the Wald approximation cannot be accepted. Further, m a 
the Wald approximations exceed the true 0.0. Гоги < Oif Z is small; these are not likely 
occur for the most frequently used schemes with small æ and f. 


Text-table 1 


Wald wald 
j h 2 True Wald True ASN. pon 
о.с. о.с. A.S.N. with A АЗ 
Wald o.c. | true 9: 
m odii 
—0-25 11.77 0-37 0-999 0-9994 4-32 14 18 
— -25 14-90 540 995 992 23-84 214 254 
— -25 10-61 5-31 -95 -934 21-46 184 dis 
4-035 | 1177 0-37 0-60 0-831 16-99 6-5 174% 
+25 1490 | 540 05 087 | 37.49 34:0 35:0 
n 10-61 5-31 -05 “066 21-46 18-4 191 
! 
—10 | 381 0-48 0-999 0-9979 149 0-5 e 
E. 1-72 0-86 -95 -848 1-63 0-6 sis 
41-0 3-31 0-48 0-10 0-382 3-94 1-6 = 
41-0 1-72 0-86 -05 -152 1-63 0-6 e 
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7 ы Comparison of exact and approximate risks 
iem bes ы 
above comparison shows that tests selected by the Wald formulae to have specified 


hose specified by amounts depending largely 


risks will i 
ill in fact usually have risks less than tl 
o make an extensive comparison of the exact 


| mes. aem mean. It would be useful t 
і.е. those NEA " ный (a.s.N.) of the exact and approximate tests with specified risks, 
computation 2 4, chosen from (6) and (8), respectively. This would, however, entail more 

e test with valu кет is warranted. Instead we compare the approximate A.S.N. for the exact 
alues h, Z taken from the tables and the approximate A.S.N. for the approximate 


test with y 
iss values Л, Z given by (9) with the true values of the A.S.N. 
approximate A.s.x. using the Wald approximation to the o.c. is al 
frequently considerable. If, however, 


T t s 
| v s M for the exact test and the difference is oweve 
ғ too small “Сіз used in (12) а better approximation to the true A.S-N. iS obtained, but itis still 
T eon es most cases and in particular no such approximation. is useful if Z «0. For 
approxim us approximation seems to yield an error of less than 10%, but as x increases the 
ation deteriorates. 
A.s.N. have been 


| Im 
proved approximations to the 0.0. and 4 
d more recently by 


ways smaller than 


derived by modifying the 
asimilar method by Kemp 


integra s 
on ee (6) and (9) (Page, 1954a) an 

: in the examples given, Kemp's approximations to P(0), №(0) are usually nearer to 
are closer approximations fora range of Z near the 


Ne true у, 

| middle Lt than Page's* but the latter e ішкің adi 
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APPENDIX 
'Table 1 
" 
0-999 0-995 0-99 сн 
h Z h 2 һ 2 h м 
(а) и = 0:25 
Ё 2 -0:19 
0-70 10:80 | —0-03 7-57 | —0-04 617 | —0-05 2:81 0-24 
-60 11-77 0:37 8-55 0-36 715 0-35 a8? 0:67 
-50 12-63 0-78 9-40 0-78 8-01 0-76 4-70 1-15 
40 13-45 1-25 10-23 1-24 8-84 1-23 se 1.73 
0:30 14-22 1:83 1112 1.82 9-72 1:81 6:42 2.53 
:30 14-98 2.64 12-19 2-63 10-80 2-62 750 3.92 
10 = = 13-78 401 12-42 4-00 oa 5-81 
-05 = — 14-90 5-40 13-87 5-39 10-61 
—— 
(b) p = 0-5 
_ 0:95 
0-70 4:49 — 0-71 2-79 — 0:72 2-08 — 0-74 0-30 Dad 
60 502 | —039 340 | —0-40 269 | —0-42 0-97 0:19 
:50 5-54 — 0-09 3-93 — 0:10 3.22 — 0-1 156 70-15 
40 6-04 0-92 4-49 0-99 3-79 0-21 2-05 51 
0:30 6-55 0-57 4-93 0-57 4-93 0-56 2.57 051 
-20 718 1-02 5:51 1-02 4-81 1-01 3:16 1-68 
10 7-95 1-73 6-34 1-79 5-64 1-72 3-99 9.37 
05 8-69 2.42 7-08 9-41 6-38 9-41 473 
-1 т ені 
(c) д = 0.75 M 
= 
0:70 211 | =143 1-01 | -145 0:56 | -148 = йы 
60 2-59 | —0.82 L50 | —084 103 | 0:86 = 0:69 
50 3-02 | —0.55 1-93 | -056 146 | —0.57 027 | "37 
40 3-42 —0-27 2.34 — 0:28 1-87 — 0-29 0:71 Е 04 
030 |. 383 0-03 2-15 0-03 9.98 0-02 114 |7 0.35 
-20 4-98 0-40 3-20 0-40 2-74 0-39 1:62 0:90 
10 490 0-94 3-82 0-94 3-36 0-93 2-25 1:39 
-05 5:42 1-42 4-34 1-42 3-88 1.42 2.78 
(d) м = 1-0 
0-70 0-85 | —145 = ХЕ ай = 3 
-60 L26 | —116 047 | —190 " = 
0-11 =Ñ — же 
-50 1-64 | —0-90 0.83 | —0-92 0-47 2 E = 
40 2-00 — 0:63 1:19 — 0-64 0-83 — 0-66 EP pas 
0:30 236 | —0-34 L55 | - 0-35 1:20 | —0-36 0:25 | Z 0.07 
-20 2-77 — 0-00 1:97 — 0-01 1-61 — 0-01 0-69 7044 
10 3-81 0-48 2-51 0-48 2-16 0-47 1-27 0-86 
05 3-75 0-89 2-95 0-89 2-60 0-89 172 NEL 
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Table 2 
Е 
в 
А 0-999 0-995 0-99 0-95 
P № а | 
+ N. N, м | х, N. N, м. КА 
(а) и = 0:35 
“2 3-15 12-00 3-00 8-16 2-84 6-53 2-00 2:92 
.50 4:32 16-99 4-15 11-86 3-07 9-67 2:97 472 
= 5-75 22.34 5-56 15-92 5-37 13-17 4.23 6:90 
7-53 28-00 7:32 20:31 7-11 17:00 5:84 9:41 
0:30 9-82 33.72 9-61 25-05 938 | 2118 797 | 1227 
20 13-06 39-10 12-83 30-17 12.58 | 9574 | 1100 | 1551 
“ШІ - iis 1283 | asso | 1806 | 3082 | 1692 | 1027 
55 — == 23-84 37-43 23-54 33-51 21-46 21-46 
0) p = 05 
p 199 | 3-35 1:95 2.39 121 199 14 nn 
0 1-49 25° ET 8-25 141 271 118 
- 9 4:53 1:45 1:41 1:97 
a0 1-77 5-83 1.73 4:22 1.63 3:53 : 7 
40 2-16 7-99 211 5:29 2.05 4-47 1:74 2-57 
0-3 р ae ^ В :50 2.92 3-27 
D 2-69 8:71 2-63 В] 848 Let 2-96 4-08 
E 3:48 10:32 3-42 (272 jen 5 i 4.26 5-02 
2 à 4-72 7 ; 02 
4-86 19:06 4:80 9:16 510 8-63 5-58 5-58 
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TABLES OF RANDOM OBSERVATIONS FROM 
STANDARD DISTRIBUTIONS 


Bv M. H. QUENOUILLE 


Research Techniques Unit, London School of Economics 


PURPOSES OF THE TABLES 


The recent increase in the popularity of Monte Carlo methods has been accompanied s. 
a corresponding increase in the supply of random observations. These are now available n 
only in the original form of random numbers (Kendall & Babington Smith, 1939; Tippet^, 
1927; Rand Corp. 1955) but also as random normal deviates (Wold, 1954), correlated random 
normal deviates (Fieller, Lewis & Pearson, 1955) and serially correlated random numbers 
and normal deviates (Kendall, 1949). 

From these published tables, it is easily possible to draw random observations from any 
distribution, the only steps involved being the calculation of the distribution function ай 4 
the transformation of rectangularly distributed observations using this function. Neverthe 
less, these two steps may require considerable calculations, and it is therefore сопу 
have readily available sets of random observations from some standard distributions. 
are provided in this table. 


These 


CONTENTS OF THE TABLES 

For Monte Carlo methods where the purpose is to investigate the sampling effects of p? 
partures from the normal (or any other standard) distribution, it is likely in most instan? " 
that the greatest information will be obtained by relating estimated values for the nor 
normal observations to the corresponding values obtained with the same observato 
transformed to normality. The table has therefore been arranged so that the values in e? 
line provide identical percentiles for each of eight different distributions each with 8 me 
of zero and a second moment of unity. e 

One thousand random observations from each distribution are provided, and these Э 
arranged in twenty groups of fifty. The basic data for these have been drawn from the 
two pages of Wold's (1954) Random Normal Deviates, and are given in the first colum: ш 
each page. Тһе further columns have been derived as follows. 


(а). 2, random rectangular deviates from the distribution 


Хаз) ах = sate =43 < ta S 43. a 


These were obtained by transforming the normal deviates, x by 
MU 


= 24/8 LP Jg — dei da, 4]. 


Б 2 t + P 
In this calculation, the expression in the bracket, was carried to four decimal place? » 
multiplied by 3-4641. The answer was rounded off to two decimal places. 


enient fO : 
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(b). ;, random deviates from a distribution whose logarithm was normally distributed 


exp — 3 [log, (2: +а) +b] dra, (> -а) (2) 


1 
(23) ах. = = 
Talis jns a) 
Bess а = (e— 1)-3 = 0-76287 and b = 0:5—log,a = 0-771. These were obtained by trans- 
orming the normal deviates, т, by 
аз = (en— еъ) (e — e) = 04627165 — 0.76287. 
2m Bives a lower limit for a of 0-763 and a modal value of — 0:593. In making this trans- 
»rmátion, €^: was calculated to three decimal places and the value of а; rounded off to two 
decima] places, 
(©). а, random deviates from the exponential distribution 


Т flay) day = еб, (>> —1. (3) 
nese ere obtained by transforming the normal deviates, 2, by 
1 ехр – айх ] -1. 
a, = —log, [E Ten) exp — 111021 
“а, s, 2, random deviates from an Edgeworth Туре А expansion 
Е (4) 


exp — ja? de, 


K; Ка pA 
f(x)dx = exp [52+ 2 р] A7) 
қ са. es OF = 0:5. 
е. . „ — 0:5 nd with жу: ку = 0*5, ка 0 
з = 0:5, к, = 0-0, with ay: Қат 0:0, к: = 098 P ich oi fficient degree 
These have been calculated мея: Cornish-Fisher ee ИЕ used, direi 
y proximation within the central range of родни Us а 
Ubstitution in formula (6-54) of Kendall and бектің s А ud 
124,416, = — 9559 + 127,225, + 8242 — 4021 7043 77 
1536x, = 141 la, + 5623 — ӛзі, 
"T" 124,416x, = — 12,1444 122,878% +1 E 
u е Е 2. ; 
te ated values of these functions are given on page 18 will break down for sufficiently 
extre, S clear that these (or any other polynomial) коте of 2; reach a minimum near 
"d М гапе! А : 
"ES Ne values of x, and the fact that tabulated va^ beyond this point. To 


DUE S : < expansion for 2 : 

Dre Pte Indicates the unreliability of the series сұқ? шысы! ера на 

bii i ionshi reen 4 ane Ae by the minimum 

ГЕ More .. oca нее 2-76 i 2.69) have been se и 
en = (— 9:79, —2/16, —9 09). gpl қ 

ач От ж, —1 8 Ties al е have been НИЕ 

че; d 56. These repla А гар — 1:82, — 1:84. 

5, should they be needed, are respectively =] 8L. that values of cumulants of the 

distrig, Tther limitation of the polynomial exP d »serted in the Edgeworth expan- 

Sion, mp On will not exactly equal the values of xs and ка d by taking expecta- 


te 
ты E ilants (evalua 
tiong "5 the true values of the third and fourth TE 
Powers of the polynomial expansions) cal 


With dk 


43042? — 106623 — 72091 + 26125. 


| | : 
2% 
2% 
= 
E 
0:00 
0:47 0-45 


Third cumulant 0.01 
Fourth cumulant | ACE : 

ni 4 below- 
Pearson 00 РР” m = 


* See note by Е. 8. 
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(e). ax, random observations from the two-sided exponential distribution 
ы 2 т (5) 
Jedd = gs ex (Ваз, (- < 25 < co). 


These were obtained by transforming the normal deviates, ә, by 


‚ 78g gna, " B cp — la2 dz, 4- log. 2]. 


places. rovide 
. 3 x Tr jr! 
All calculations were repeated (in many instances by the same computer) to 1 


jth card 
a check. A further check was provided by punching cach row of results on a Hollerith са 


к be 
and ordering the set of these according to values of x,. All other variables should then 
arranged in order. 


Е З ; ‚ ог five 
This check revealed (apart from a number of errors in punching the cards) four О 
omissions of minus signs. No gross error was found. 


PROPERTIES OF THE DISTRIBUTIONS 


: z ; Й erved 
Тһе theoretical means and variances were, of course, zero and unity. The over all obs 
values showed close agreement with these theoretical values: 


24 2% 2% 


| "о decimal 
The values were caleulated to four decimal places and rounded off to two d | 


Mean 0:00 | 0-00 | —0-02 — 0:01 | 0:00 | 


0-00 
Second moment about origin | 0-95 | 0-98 0:81% 
| 


0-00 
0:95 | 0:93 | 


e 


00 
0.93 | 0-96 | 0:95 


7 : 1 the 
Theoretical values of the cumulants of the different distributions are show? Ы, 
following table: 


| | те | 
| 521 Ta | 2а 24 | а To | 24 | 28 | 
| 
| | j 
к, 0-0 | 0:0 | 6-2 2-0 0-5 0-0 | 0-5 0-0 | 
Ks | 00-12 110-9 60 | 00 | o5 | o6 3:0 | 
Ki 0-0 | 0-0 5154-7 24-0 0-0 0-0 0-0 0:0 
Kg | 0:0 | 6-9 | 617375 120-0 0-0 | 0-0 0-0 30-0 
| 


TM M" ing 
Тһе variations in the forms of distributions are fairly well demonstrated by the follow 
frequency table for the whole group of observations: 


40 
H x d» 
* Owing to the high value of x, for this distribution, thi а 


+ вр 
> 2 8 seemingly extreme value corre 
& two-tailed probability of greater than 0-5. gy extrem 


= 


М.Н. QUENOUILLE 181 


Value. ^ 7 
: s of the transformations for the Edgeworth Туре A distributions 


т. ж 
DO" 1 Та тз т, 2% Te =. 2, 
8:00 to 8-09 
7-00 to — 7.99 1 
6:00 to 6-00 2 | 
hen to 599 1 
44 to 499 0 3 
E to 3-99 В 2 ; 1 
22000 540 | 2 4 : . 1 3 
zur т P rf 2) fl Seas 
Pato витр | : 3 | 4 2 3 1 5 
00а: mw | 3 4 5 1 7 13 
1-60 to n оч А 5| 23 10 u 8 : 
1-90 to Pe | 29 34 20 18 33 27 31 M 
08040119 62 117 18 43 60 58 58 40 
040 to 0.19 100 122 43 59 87 86 79 70 
E pog io 10:90 131 117 70 78 120 136 116 124 
7940 to — 0-01 156 108 128 121 145 169 146 216 
7080 to =p 152 120 296 182 154 162 167 210 
= 1-20 to — 0-31 150 110 397 293 154 154 174 152 
1580 to =a | 96 141 : 158 117 86 112 55 
249019 — 1-61 48 95 Ж 4 66 44 56 82 
Е 240 to _ 9.9 30 36 ” Ы 38 27 26 21 
280 to _9 1 20 Н 4 р 17 5 12 
3.95 9 --241 д | 
3.20 6 + 9 2 12 
Sgan 281 : 
d 60 to 3-91 Б . 5 
*00 to 3-61 è E 1 
* — P Ami o a 
TI 
о 115-47. The two end groups have 


xpeoray Хресь 
сеа oa pog in the middle eight groupings here ar 
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182 Frequencies of the sets of 1000 random observations 
: [— 9552 + 127,225a + 78242? — 4023 + 576211 — 25245) 
(а) 25 = 134416 = 
" т; 
m 2% 2 я; z 2% 2 Ts % ы 
к F 2.221 
3.9 n 1-9 1-680 0-6 0-007 | 07 | 0670 2 0 vent 
ва -18 | -16) | -05 | -0572 | 08 | 0782 | 21 | Zir 
NT =1т | -1564 | —04 | -0476 | 0-9 | 0-896 TE 
—2.9 -16 | -1499 | -оз | –0378 | 10 | ront | 2 Mn 
-28 —1-5 | -1429| -02 | -о0979 | 141 1127 zs 2.851 
-27 1-4 1-356 01 0178 | 12 оа | oS оет 
-2.6 -r3 | —1-278 0 —0-077 | 13 1:364 | 26 
2-7 3-092 
-28 | -1857 | —12 | —1-198 0-1 0:026 | 14 | 1484 ат 5.208 
-24 | —r849 | —11 | —1115 02 | 0130 | 15 1-606 28 | 3.302 
-23 | —1-832 | —10 | -1-099 0-3 0336 | 16 | 1-728 Ж jum 
-223 | -1806 | —09 | —0-942 0-4 0342 | 17 1-852 i 3.536 
— 2-1 -1-771 -0:8 — 0-852 0-5 0-450 1:8 гая de 3-635 
ы 1 . . .2 
-20 | -149 | —07 | —0-760 0-6 0-560 | 1-9 jn 
1 
%----- [14110 + 5623 — 325] 
(b) ® = 1561 
+ £s +2, +2 +% 
0-655 14 1:376 9.1 9.187 
0-753 1-5 1-486 2.2 | 2.309 
0-852 1.6 1.599 23 | 2431 
0-953 1-7 1-713 2.4 | 2-553 
1-056 18 1.829 2.5 | 2.675 
1-160 1-9 1-947 2-6 | 2-797 
1:967 9.0 9.066 


N.B. Negative values of x give negative values of 2%. 


Mare iil 12,144 + 122,878 + 14,3042? — 106623 — 7202: + 26125] 


т 2% т 2% т 2% т 24 i 
502 | жч | -19 | 168 | 068 | iig | or | обв | 20 
CE | 934 | —18 | 512668 | —06 | —0582 | os | uoo | 91 
езі LBS | 44 | 1б | ыбы | сым | ou | gem | № 
238 | “ӘМ | -b6 | -ra | 5 | —es | а | pem | м 
-28 | -24% | -15 | —1397 | —02 | бору | 11 1.111 2-4 
еРІ | -22Ш | -44 | -1265 | «d | ож | 32 | ұша | 28 
-26 | —2.251 | r3 | —1193 0 -0-098 | r3 1:353 2.6 
-28 | —2445 | —12 | —1420 0:1 0-002 | 14 | 1476 | 27 
-24 | —2046 | —11 | —1045 0-2 0104 | r5 | 1-600 | 28 
-39 ЭВ -19 | men | $$ | оба | va | peg | 04 
-29 | —L866 | —09 | -0-899 0-4 0-315 | 17 | 1853 | 3-0 
-21 | —1-784 | -08 | —osi3 0-5 0424 | rs | ros2 | 31 

2.0 1-704 0-7 0-731 0-6 0-534 | 19 | 2112 | 32 


TABLE OF STANDARDIZED RANDOM DEVIATES 183 
First fifty 
ү та E 2, ау 2% = Tg 
= 
i (бе [epos | oos; | some [059 | ze |=08 печ 
2 1-37 1-44 1-06 1-46 1-45 1-34 144 1-25 
3 | —018 —0-25 us | -pa |0% | OUT | -027 |5 0-11 
4 0-35 0-47 | —011 0:01 0-29 032 0-26 0-23 
5 2.89 1.79 7-00 5:03 3:23 3:06 3:42 3:78 
6 2.12 :67 j 3-07 2-38 2.21 2.40 2.39 
ғ pe ME и 20989 | —102 |-094 |-096 | —080 
> Ex gc 5 Т 9 
3 0-34 046 | —0-11 0-00 0-28 0-31 0-95 0-22 
3 5 ў 0-60 0-60 0-58 0-46 
0:64 0-83 0-11 0-34 Hn na 
ш 0-11 015 | —035 | —0-22 0-04 0-10 
— 0-02 —0-12 — 0:01 
П pm | ерш |-ӨШ |- он e "a EC EN 
4 |-%6 [| Oo Іп | uz 1; 
: 001 ух gs ў -00 —013 | -002 
» -003 |-o04 | =031 |-993 00 P -048 | —037 
а — 0:41 — 0:55 — 0:46 — 0:58 = 
16 0:32 0-58 0:58 0:56 0-44 
ІТ 0-62 0-81 0-10 hire 0-82 0-78 0-79 0-64 
18 88 1:09 @30 000 | ова |-04% | 080. | -008 
ip | бв — 0-98 — 0:55 E — 0.05 0:03 | —0-07 0-02 
P 0-03 004 |-o2 ]|-9299 |71; |-249 |-195 | —278 
— 2:30 — 1.70 — 0:72 — 0:99 и NX 
21 .52 0-76 0-73 0:74 
22 0:78 0-98 0-25 eo 0-63 0-62 0-60 0-48 
23 0-66 0:85 0:13 ЖЕТЕР -049 | -022 
л —0-34 —0-46 — 0-43 Ei 1-20 112 118 0-99 
25 1.16 1:31 0:71 n 0-74 0-71 0-71 0-57 
а 0-76 0:96 0-23 0:5 роз |-095 |-097 0:81 
в Iano [rr | ово |-98 |75 |әм Е — 0-20 
27 | —032 048. | 049 |098 | бду 017 0-08 011 
28 A -015 01 | —0-02 0-05 
99 0:18 0-25 — 0:21 0:24 0-01 0-07 Eos 235 
30 0-08 0:11 — 0:26 = 73 2.20 2-04 f. 
1-98 1.65 2.59 2. 0:86 0:89 0-72 
31 0-71 0:91 о | -040 | –020 
39 0-91 1:10 0:39 0.53 _ 0:40 iei Bis 21:81 
33 — 0:39 — 0:43 — 0-43 098 1:61 pr 006 0-10 
34 — 1.77 — 1.60 — 0:68 017 0-09 Е 0:75 0.60 
35 0-16 0:22 — 0:22 Е 0:77 0- 
s .26 0-54 Р 0-10 0:12 
0.79 0-99 0-2 3 0-18 
6 _013 01 1-28 1:37 117 
37 0:20 0:27 | -020 1.36 1.38 032 0-26 0-23 
38 131 1-40 0-95 0.01 0-29 or 0-69 0-55 
39 0:35 0-47 — 0:11 041 0:72 0.25 0-18 0-17 
40 0:74 0:94 ол — 0:07 0:20 1:37 1:46 1:28 
4l ig par, [e 1.50 147 gs: |023 ы: 
42 1:39 1-45 1:09 041 — 0:22 Lon —0-78 — 0.57 
43 — 0-14 — 0-19 — 0:36 7075 — 0:82 m 0-28 is 
44 |7076 | —0.96 | 058 0.08 031 | зд |2194 -2 
45 0-37 0-50 | —0-09 б = 1:83 0:27 0-23 
в | 7228 | -16 | -072 езу 020 EM ud 1-25 
4; | 0:36 jay |-ыш | б | ME X Ne 
48 137 1-44 1-06 00 1:06 063 | -07 Eum 
49 1:04 1:22 0-55 0: ; ж 0-30 20 
50 — 0-67 — 0-36 _ 0-53 
м = 0-53 — 0:08 
Та) 
(оз) 10-48 11-51 8-42 
кс | “чш 46-6287 | 76-3842 


184 TABLE OF STANDARDIZED RANDOM DEVIATES (cont.) 
Second fifty 
| | | | В ES 
жү та то ч "e in + 
1 0-68 0-87 0-15 0-39 0-65 0-64 pis d 
2 | отт |-097 |-055 |-015 |-0s2 | -072 |-о79 | 75, 
3 |—139 — 145 — 0-65 — 0-91 — 1-35 = 1:37 — 1:26 9:04 
а |-191 |-L63 | —0.69 0-97 1-63 1-96 ми у= 
5 0-05 0-07 — 0:98 — 0:27 — 0:03 0:05 — 0:05 0 
" 132 141 097 | 137 1:39 1-99 1-38 M 
т |-067 |-046 | -053 | -o071 |-073 |-063 | -0-71 ae 
s |-081 | -101 | —0-56 0-77 0-86 өле | -082 | о 
9 0-96 1:15 0:45 0-78 0:97 0-91 0:95 Ер 
10 | -0-24 | —033 |-040 | —048 | —0-32 | -0-22 | —033 = 
1 | -066 | -085 | -052 | -071 —0-72 | —0-62 | —0-70 aris 
12 1-08 1-25 060 | 0:97 110 1-04 1.09 MED 
13 0-27 037 | —016 | —0-07 0-20 0-25 Ола odi 
14 0-46 0-61 — 0-03 0-13 0-41 0-43 0-38 058 
15 | -077 | -097 |-055 |-075 | -082 | -072 | -o79 | -9? 
16 0-00 0-00 | —030 | —0-31 | —0-08 000 | —0-10 ee 
17 1-01 1:19 0-51 0-86 1-02 0-96 1.00 not 
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ха) 15-81 14-56 16-81 16-92 16: ; . 
I(x?) | 57-3619 | 52-9306 | 112-8177 | 89-9754 SETST bons 65.1299 
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9 1-68 1-57 1-72 2-07 | 183 1:69 
10 — 0:17 — 0-23 — 0:37 — 0-43 | — 0-25 — 0-16 
ll | -0-14 -0:19 — 0:36 — 0-41 — 0-22 — 0:13 
12 0-68 0:87 015 | 039 0-65 0-64 
13 0-18 0-25 -021 | -0-15 0-11 0:17 
14 | -1-71 — 1-58 —068 | —096 | —1-57 — 1:72 
15 | —0-28 — 0-38 0-41 0-51 0-36 0-26 
16 0-94. 113 0-42 | 075 0-94 0-89 
17 033 | 045 -012 | —001 0-27 0-30 
18 1:18 1:32 0-74 | 143 1.22 1-14 
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24 | —0-54 — 0-71 — 0-49 — 0-65 = 0-61 — 0:50 
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28 | —0-41 — 0-55 — 0-46 — 0-58 — 0-49 — 0:38 
29 1-27 1-38 0-88 1-28 1:33 1-23 
30 | —1-22 —1:35 — 0-63 — 0-88 —1-21 — 1-18 
31 1576 1-60 1-93 2.24 1-93 1-78 
32 | —0:20 — 0-27 — 0-38 — 0-45 — 0-28 — 0:18 
33 1-05 1:22 0-56 0-92 1-07 1:00 
34 | —0-95 —114 — 0-58 — 0-81 — 0-99 — 0-90 
35 | —0-16 — 0-22 — 0-37 — 0:43 — 0-24 — 0-15 
36 | -131 -1-27 —0-61 — 0-86 —142 -1-07 
37 0-97 1-16 0-46 0-80 0-98 0-92 
38 1-53 1-51 1-37 1-76 1-64 1:52 
39 | —0-05 —0-07 — 0:32 — 0-35 —013 — 0-05 
40 0-64 0-83 0-11 0-34 0-60 0-60 
41 — 1-83 — 1-62 — 0-69 —0-97 — 1-64 — 1-86 
42 | —073 — 0-93 — 0-54 — 0-74 — 0-79 — 0-68 
43 1-26 1-37 0-87 1-27 1:32 1-22 
44 — 0:11 — 0-15 — 0:35 — 0-39 —0-19 — 0:10 
45 0-02 0-03 — 0-29 — 0-29 — 0-06 0-02 
46 | —1-52 — 1-51 — 0-66 — 0-93 — 1-44 — 1-51 
47 — 1-61 — 1-55 — 0-67 — 0-94 — 1-51 — 1.61 
43 — 0:83 — 1-03 — 0-56 — 0-77 — 0-88 —0-78 
49 1-78 1-60 1-98 2.28 1-95 1-81 
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31 | -063 |—-082 | -o52 |069 | –овә |-os9 |-067 |77 
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88 | 048 |-058 | -0-48 | 0 | -o50 | оло |-050 |7016 
39 | -0-26 | —036 | —0-41 —049 | —034 | —0-24 — 0:35 ~ 4.98 
40 1-73 1:59 1:85 2-17 1:89 1-75 1:89 Je 
4l 0-64 0-83 0-52 — 0-70 — 0-70 — 0:60 — 0:68 gt 
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43 | —2-08 -1-67 = 0-71 — 0-98 — 1-76 ~2-16 = ~ 0:31 
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3 


ABLE OF 2 
I OF STANDARDIZED RANDOM DEVIATES (cont ) 201 


Nineteenth fifty 
21 | Hg т 1) 
А | 3 а, 2% 2% 2; z 
: -0:19 — 0:26 — 0-38 — 0-45 0-2 
ME р E» в — 0:27 -017 | —0-28 | —0-12 
X. conan 0-20 0-06 012 0-03 0 
2 | —102 |-056 | —0-77 7 m 
4 dS ue M 0-77 — 0:87 — 0-77 — 0:83 — 0-63 
51255 1528 1-53 " 4 -026 | -017 | –027 -011 
әке e. € | —185 | 259 | -208 | -297 
S1. % -25 59 07 7 Р 
8 li MESS —117 —0-59 — 0:82 = м = pen B E 
9 | —0-81 іе |с | ав | |-Жа =й _ 0-05 
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16 007 | 010 — 0:27 — 0:25 0-00 0-06 — 0-03 0:05 
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33 0-57 0-78 EM а —0-92 — 0:83 — 0-38 T 0-40 
34 0-98 117 in 0-26 0-53 0:53 0:50 0-79 
35 — 0:38 —0-51 а. 0-81 0-99 0-93 0-97 0:25 
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42 —0-94 ein. —0-58 —Dal — 0-82 — 0-71 — 0:78 0-15 
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(°) SZOLT | mmm желе) Я 


[ 203 ] 


Note оп Mr Quenouille’s Edgeworth Type A transformation 


Bv E. S. PEARSON 
University College London 
), by a limited number of terms of an Edgeworth 


In 
representi 
ntin қ 
pe A ean non-normal frequency function, Дт 
sion, statisticians have frequently used the form 


7 T 3 
Ла) = де" | аз 32) +54 (nt — Ga? + 3) -- 55 (a8 — 15x! + 452— 15}. (1) 
This di v (27) 6 24 72 
Istri = 
ribution, for example, formed the basis of Gayen’s (1949, 1950) papers exploring the effect of 
Srivastava in а paper 


eparture fr 
i from normality on several common statistical tests. It is also used by 
tant to realize, however, that the distributions 


Printed 1 
Oba, мер = in this issue of Biometrika. It is impor ‹ e ; l 
Renouille о т; derived from a normally distributed 21 through the Cornish-Fisher equations given by 
Seemed ofi р. 179 above, will not correspond exactly to particular cases of the frequency function (1). 

i) For iip. ND to examine the extent of this difference. AM | E 
nder t] ase кз = 0-5, ку = 0-0, f (2) of equation (1) becomes negative at v = — 2:53, the negative 
Slo . 16 curve beyond this point amounting to — 0-0028. This means that the area above — 2:53 
Or is in excess by З іп 1000. For the Quenouille transformation the critical point ваваь = — 1:86. 


US there jg ; р 
that ар tpo © inevitabl і го distributi t the lower tail. It was found 
y so "eme! „ween the two distributions à |. It was 
y some disagreement between led fS peed Pn de ior етте Кыш) 


E the К 
Pogins lo Aen tail there was good agreement up t Aem а 182 
1) ase steadily over Quenouille's value iven in his table on p. ^7** Жез 
Scarce i 9: е саво кз = 0:0, к pec б fin) for (1) pes not become negative and the two 2; distributions 
ш) oe until after a= +34. © ar 
the caso of ку = 0-5, к : ? tion (1) becomes negative 85 2% --26б: 
і = 0-5, к, = 0-5, the Дт) of equa с 
Mai ii at about " =- 3-35, but the en of this anomaly 18 much smaller л B d 
599 ог eu negative and positive areas almost cancel out, the area yar be wa v dme 
giv, Se j a negligibly in error. In the lower tail, the че a ае em. еу ue 
еј ouille’ в и — 2]; beyon ў 
tho iin dne inc зеді жотаны а а а on е һе a, of (1)- However, as the table below 
i i 8 Increasingly greater negative v alues than t 7 


3,8 di : a 0. . 
е s this a * жыра шна en mere a ee tad Edgeworth distributions of 
j given the expected frequencies 


9n ог the two авуп ; ille's two samples 
a 0), expressed іп terms of totals of 1000. These are compared with UE tion of ay. For (1) 
i E Тһе only appreciable difference is at the ewe 2 
S га, рес E Бе Е "1 2-0 to — 2°53) W 

4 i ed frequency of 8-6 in the interv al О y tesh 


hile no observations are possible in 
i тү ffit is used to compare 
fl пред n 8 Quenouille's transformation. However 
: NR bled below; 


tivo 


shown by the braces, 
S i 1 H 
thas quencies with the expectations t& 


(a) 
К 
Жы 
(5) ата к: = 0-0: д? = 6-29, у’ = 18; 
Sith 3 = 0-5, к — 0-5: y2= 15:65, Y= 14. ‘lof thea; 
dista T of t) * E lm ы -copt at the lower tail of the 2% 
л hi that excep AE 
‘bution 89 Values is significant. One must conclude therefore enouille’s three Edgeworth xr 
difference between уже М samples as large as 1000. 


tig, Uti 
ng Оп it w à 
and Ould be difficult to detect any f equation (1). 


thos Р 
© derived from the commonly used form 0: 


REFERENCES 
Barron, р. Е. & Dennis, К. E. Б. (195969 
Gavnn, А. К. (1949). Biometrika, 36. 208-25. 
Gaven, А. К. (1950). Biometrika, 7. заа = 
TS igation. Бі fs abd 

нез (1952) investi ерле. 


а) of 
ing v! 


ж 
а( Thi. 
р 18 ів 
1 to 
t тү?) pene expected from Barton & Denn 
at ig, Outside the region іп which the f( 
ifference between the correspon 
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Note on Mr Quenowille s Edgeworth Type А transformation 


Comparison of Edgeworth Type A distributions 


Case x, = 0:5, к, = 0-0, Case ку = 0:5, к. = 0-5, 
i.e. distribution of 2 i.c. distribution of 2; 
т ————— еледі 
- 
Expectation Quenouille's Expectation Quenouille 

from (1) sample from (1) samp 

> 40 0-4 - 0-5 = 

3-6 to 4-0 0-8 == HA 1 
3:2 to 3-6 1.8 3 2.5 3 
2-8 to 3:2 3:9 2 5.0 А 
24 to 28 84 T 9.3 қ 
2-0 to 24 17-5 10 16-7 3 
1.6 to 2-0 33-8 33 30-0 ? 
1:2 to L6 57-5 60 51-7 58 
0-8 to 1:2 85-8 87 82-3 fi 

0-4 to 0-8 114-9 120 117-9 e 
0-0 to 0-4 140-7 145 149-8 м 

-04 to 00 155-6 154 1647 к 

—08 to —0-4 149-0 154 159-1 He 
—12 to -08 1174 117 113-9 и 

-l6 to -19 72-0 66 66-3 2 
-20 to —16 31-9 38 28-1 2 
—2:4 to -90 8:3 — 7:5 : 

<-2ж 0:3 = oat ^ 

= n | 
Total 1000-0 1000 1000-0 1000 


* For equation (1), the lower limit for ж; is — 2-53 and for x, it is — 2:65. 
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MONOMIAL-MONOMIAL SYMMETRIC FUNCTION TABLES 


By JACK LEVINE 
University of North Carolina, Raleigh 


l. TI 
Tables қ 4. monomial-monomial (MM) Table is a continuation of the David-Kendall 
VU = 8 are X: (1951), (1953), (1955). АП monomial products up to and including weight 
isted as linear functions of monomials. Thus, Table 8, section (iii) and (iv), gives 
+4(43 1)+ s(42?) + 6(421?) 4 6(322) 
4, 8(3221) + 24(24) + 12(2°1°). 


a theorem of 


(21)2 
№ (2) = (62) + 2(612) + 2(53) + 2(521) + 2(#) 


2 Th 
1 8 x 1 H B H ж. Ж 
аоМаһо ise of MM expansions іп problems of distributions 1$ illustrated by 
n. 
эш 1 ә а 5 
is nber of distributions of n objects of specification [m Ma -+ mj in r different boxes 
> 2 5 n . + n 
ое сте Е = ch that В; contains p; objects of specification [ту Mai + ть] is given by the 
Product my +- m, ОЁ the monomial symmetric function (n ++ my) in the expansion of the 
a" = 
( Тут... тр) (түз Mag ++ Mpo) e (Myr May ++» ғ) = УС, MENOS ГАХ 
— > 
qe] of = Dg; = “Юг 


esu 
Mmati 
io 5 xis 
n being over all partitions [M+ 
ator, 


the Tables of lower 


3. С 

x Onstr . , 

Wei, truction of the T , MacMahon’s Р-орег ia 
is ТЫН стар інен n ht. Checks were obtained by multiplica- 


ti ет à ; 
“on o 5 used to build up those of higher welg 


actor: 4 
s н 

in different orders. 
aid in the construction 


The 
** these Tab is indebted to Mr Charles N. Anderson for his valuable 
es 
(00). 


f£ Naval Research Contract Nonr 870 


ork on 
these Tables was supported by Office o 


REFERENCES Р 
(1949). Biometrika, Е 1 


Davin, Е. М. & KEN M.G 

‚Е. N. & KENDALL, № 7 T ka. 
Davin, Е. М. & KENDALL, M. G. (1951). mi 40, 47. 
Davin, Е. №. & KENDALL, M. С. (1953) Be ties , 223. 
Davin, Е. М. & KENDALL, M. G. (1955). ae 


р 
206 Monomial-monomial symmetric function tables | 


'Table 2 


4 
w-2|(2 (19 
ez в Table 5 
w=s | (5) (ат) (32) (31°) G' (21° 09 
Table 3 e | 
ЕЕ ЕЕ ac 1 1 . . 
w=3 | (3) (21) G9 ino 2 x ox Хх К x 
---с:>:О 510) 1 1 Š i 
2*) (1 Y 1 1 » E 
Eri (1) ә H 2 3 š E 
21102) 1 1 2 E . 
(21) (1*) ж, 1 2 3 . 
a (12) “ 1 I . 
1) (1) I 5 | 
Table 4 Oo Гра | 
(2n) (* N I 3 4 6 6 
2)? (1 и @ 4 è 
“-4 | (4) (31) (2) (2:5) (19) (2) (12) (1) Я І í 3 z 3 
ms >= : 1 а 7 20 
| 12): (9 : 1 2 5 12 730 ^ 
(3) (1) 1 1 3 ч " (1) 1 3 4 6 6 6 M | 
21) (1) i 1 2 2 M 1) (1)? т 1 3 7 12 27 бо 
2) т P 2 5 H (1) 1 10 20 30 60 120 
me: в do: : | 
19 (1 E E 1 
s. á 1 2 $ | 
Әрізізіш 
1% 1 4 6 12 24 
Table 6 
- 
E © (б) а Gr) 09 аш) Gr) (9) (ам) @ аә 
) 
fe Е 1 5 
4 R 1 x | 
Ge 19 = » ы | 
3000) : а 8 7 1 
31 8 " 1^ Ж 1 3 | 
3) 1°) : 1 2 3 2 
33 (21) : E 2 * 2 : i 
Р 
260 1 1 . : 
23 (A 1 2 3 2 . 
2%) (1*) 3 . . 
23 5 д 1 1 y 1 : 
zl 200% i ; 2 ш 003 
= (13) x a 2 2 А 8 1 E A 
2) (1*) а x 3 . 2 4 : 
m 8 . 1 s 
E ai . . i А z Е 1 6 
ТІРІ: ee А 
3290) i 1 2 4 2 3 6 Е : 
3109.0) H : 2 2001 à қ 
25 (0: Е І 3 - 1 3 : 
215) (1)* . B 5 а 2 5 3 a 
21) (2) (1) 4 1 2 2 $ 2 6 5 10 12 
ауаза) = B a ME. 
25: a x P 3 й ә E ш 6 то 12 
а КЩ : т БВ жоо >й 3 | 
й ^ 1 2 » 2 4 2 4 
tay E 3 Е : : 1 3 + ; 
i ; с 7 ж 8 8 = B a - 
ii Ye» т алт 3 sg 3 $ ^ 9 B ж 
m D 1 2 3 2 Е E 34 24 24 Я 
Sur |: f of Y is od 4 
e | a» toe Be Еш а = 
ac. ` 4 7 2 8 16 15 15 34 5 180 
D с 6 4 4d P Zag № а 


207 
Table 7 


D ar) (17) 
з) (зч) G Ger) (310) (М) (r) (аш 
i 6 (52) (51) (43) (2) (н) в 
vw 20) () (6б) (52 
LI MANT 5.2 г | 
(©) 1 1 ; i : 
51) (1) š 1 І : 
(5) (2) 1 У ; | 
и | x 1 1 1 à 
2) (1 А : 
fh | 1 : E ; 
41) (2) i : : 
(41) (12) i 2 ; 3 : 
(8) (3) i ; 
(4) (21) Б Š | | : 
4) (1° 1 " 3 : i : 
E 522074 i ; 
КІРІ : T i a 2 $ 1 4 Е s 
эз} : ғ 5 : Li . Ж: : . . 
32) (13) : : ; i ; 
na | i 3 p Я ; ; 
DG Ы 2 2 . 1 
31) (3) 1 : 1 i 1 
36), 1 à х : 
(31) (a $ i : : | : 
3 EN 1 Я 1 d : i ; 
3) (21?) і : 
2 1 - i 3 3 М 
E 1 i 2 2 Н 3 Е : i 
ah 1 1 1 s b 1 3 5 
(25) ED Е | . : 4 : 
m : 1 a о 
fada) i i : : | 
en а 1 ч . 1 $ 
SEIN Б : 1 4 я . 2 7 
IRE 21) ; ; j 
215) (13) : | 3 1 
a 1 3 
2) (1 : 
2n Et | 
fp | | кок О 2 
(5) (1): i ; : : : | 
ИИ à d 
ў үй 1 1 3 2 2 $ à 22 бе ae ; 
п, | т О геа а м 
о Ее. { 
3) ШУ, 1 i 2001 : p^ 2 s 
3 (1) 1 2 a) GU € 1) 
ау Gn Gm G 22% 
2 B 
3) (420 | : ; : 
““?б)у б) (б) (52) (51) G i т | 
и ии a 
ШШ : : 1 i 1 5$ : 504 x B 2 0556 7: 
› ol ; = 2 2 3 4 5 i4 М а 
38) $ 2 I : s 3 i à } Mee : 
К 1 : . 3 Е 4 $ ў Р 
2) (20) i 1 1 Я А Roe j b RI: i 
апуу à h 3 2 5 . 3 ; ч E 
e ШІ : Е 1 ; 6 10 Ы . 6 E. 20 | 
211) 15) (1) . 6 Б 4 5 E 6 24 зо : 
21): s 3 5 Е 4 4 2 20 % 4 : : 
i e 1 2 2 4 3 2 H 12 м : 3 Е E 
E s я т 2 : - 2 т . 1 2 3 2 25 105 
5 a H m > 1 а i 3 1 3 18 о 140 
294 (із) i 5 Е i$ 3 Me 
: ІЗ ч ^ . $ 2 12 31 
is m B š 1 5 12 а 
ч P Aet сч. a s v 
қ ; é a 5 t А 
so" ó 1 з d 5 2 м 9 s 6 ; 
31) (у 1 3 6 В 9 : 4 2 tog d E 
E 3 6 2 12 H Т 23 Е 8 P о 
NN Е 3 2 3 5 " 12 10 8 3 Ч 
(12 (rye 1 2 2 H 3 6 12 9 3 ó 
sm ы H 2 В 4 2 8 16 Е 6 6 20 z 
ЧҮ s 3 1 . 12 6 6 о: 
2) R) O 1 : 4 4 H 15 Е 4 H 20 "5 24 5 219 
(ау а) (1): 1 3 2 3 3 3 6 2 и - @ 1% зо 
aja 1 I 3 г 3 2 2 7 3 4 27 7 б 
3j (аз) (т) 3 2 2 Es 4 13 4 $ 7 . 
2 | x) 1 I x . 5 : A 5 27 2 à A s 
447! . 1 2 12 55 Е Да 2 
epa n * А 3 6 2 : до 
SUM " s 1 Н H : 
и 12 
3o 
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Monomial-monomial symmetric function tables 


Table 8 


(8) 


(71) 


(62) 


(61*) 


(53) (521) (зт?) 


(+) 


(431) 


(42°) 


(421?) 


(5) (21) 
5) (13) 


Am» 
Gn 


$1) 03 
(19) 09 
ay 


we 


ED 


IE 


... 


m 


"T 


| 
ЕЯ 
2 255 
РЧ 
* 


аа’ РАЙ 


coc QGOGGGGGGGG 


горел 
COGQG 


"€———————————— M—nm—À e РЕАК 
RR RS 
ты К 


(32) Gu) 


+ Rew 


ы 


ыы. 
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Table 8 (cont.) 


ы. 


„ж; TES 


(3271) 


* Bun 


pune: 


ко рк а 


Gr) GY 6) (ч) (Gu) (а) — 05 
| 
3 3 
5 i 
i É 
Р 5 
5 ч 
3 10 
3 10 . : 
Н 5 4 : 
1 à : 
à 1 % 2 
X 2 м . 
1 . 
: ы 3 4 : 
3 ; я 
1 i 5 à Р 
; 6 Р М 
; я f Y ; 
S : 8 B 3 
ы 2 Е : 
: 1 E Б 3 15 : 
1 > ж, 8 5 M 
1 5 E 12 a Е 
$ 70 d 2 а g 
6 к ë 3 2 n 
10 . е т 8 
ME : 
: i 6 28 
^ 15 56 
: 1 4 20 7 
1 2 
Biom. 46 
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Monomial-monomial symmetric function tables 


Table 8 (cont.) 


w = 8 (iii) [3] Gn (62) (бі) (53) (G2) (519) (43) аз) (4) 42”) 
(Су, 1 2 1 2 Я қ x 2 $ + . 
51)(1 " 1 2 $ A . . 
3 (40, 1 1 1 e i $ s Š á В . 
5) (12) (1) . 1 x 3 “ 1 3 В E а + 
2) (1) т 1 a 2 2 4 2 2 2 2 
15) (1)? а Е 1 r 2 6 $ 1 2 5 
5n (20) 1 1 2 1 1 В 2 2 2 2 
in 19(9 : 1 2 1 4 9 2 1 2 5 
4) (3) (1) 1 H 5 2 1 $ Е 2 т В : 
(4) (21) (1) 1 2 2 1 2 d 1 2 2 
(4) (2)? Р 
i 23 қ i А i i i : 2 1 г 1 
4 (1) (т : : 1 А 1 , : ; 1 
CI i 2 р н 3 : н i 2 
321) (1)? à р 5 i қ * 2 М 4 
G2) (20) i 3 1 i Ч Z 5 2 x 
32) (1°) (1) е à 1 1 М 2 2 3 
КУ) ; : d Ж 2 
31°) (2) (1) 1 1 3 з 2 a r 
31°) (1°) (1) . 5 š 
e» SI Е H i 5 g x : 5 i > E 
geen) | : Е 1 2 2 4 6 а P 4 4 
31) (2) (тї 2 2 : : 3 2 ч Е H : : 
б (19) B H ы р » М 1 3 2 3 2 2 
31) 19 Е В 5 Н i а 8 i : 4 19 
бі 9, Е : 1 : 2 2 Е е E С 
3) (24) (1) : Б 1 E 1 i b 3 i 
(3) (212 
G fene) { ; : ; 2 3 ; А 2 3 
(3 E 1?) м 3 s : 2 . 2 1 2 2 
8 211 А E а 3 . 2 2 3 
3) (14 tr е 5 5 1 * 1 
(3 (15 12) . . 1 . „ 3 
20 3 1 3 B т 
АЧУ, TEM = i d 
“уа я : А 
2] 0 1 1 3 2 
. 2 
{а 220 * . 1 1 : 2 
Си 3e : . . x A 5 H 1 
96 | ш ко Pl | 
ате ы . 
21°, "m % . 5 А 7 ? > 
21?) (1°) (1) m ы 1 x . 5 
2n 21) (1) = . . М з H 
213) (2)* 1 1 3 3 4 5 
(а (2) у . ё B = ` 
rie б йк 1 3 i © Pod 
rn с б $3 à jd iP jd 
Ir Ы “ 1 
2: (52 i 1 6 + 5 Р 2 
e (13) (1°) . > Е х P x 
2) (1%) à i 2 5 
@ 090) i t r , 
gae er 
ate MEE: 
СЕ 9 Б 
109 (1з) 
(уа? 
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te = 8 (iv) w5 бш ÇH м Gu Gy ӨМ Gu) Quy (80 ШШ 
6) (1) и | 
51) (1): : 
5) (2) (1) Е 
5) (12) (1) б 
o. ; 
1 т 
ТЕН 
9009 Е 
4) (21) (1) B 
4) (2)? % . 
DR К ; : А : : 
4) ie) Н * м Д 5 3 
ЖА |t 0 Pod +4: 
т 6 e 
H (2) 1) : Я 3 2 2 4 
32) (15) (1) а 2 6 2 3 = 
315) (1): 8 2 2 7 39 
31) (2) (1) = 2 6 2 3 ` 
ЖАН 18 2 4 5 13 Не 
HON : ê $ ё 6 
31) (2) (1: 6 2 2 о 
за) | в > g EP 4 & 
BER |: 3! 4 3 tg 
1 
3) (25) (1) 2 р 1 
о 2 * P $ I Іі 3 
3 21) (12) а 2 3 2 3 2 А : 
3 B I 
МІРІ fci P5 13 
. в A o 4 
amm d i i: 44 % 
211) (туз . 2 5 5 12 р 2 
211) (9) H 2 M 15 ы 
ач) у 2 6 3 3 » 6 
эм 
21) (2) (12) 2 3 : 3 4 В А о 
Жр: о: ENT ы 
ан : á % 4 3 10 4 6 E. 60 
мініс із 4 9 8 NM D 
2n) 130) 4 : а Я 15 Н Е B - 
213) 21) Q) 12 6 10 ү! . Е 12 о 
б а 5 12 Hn be 
Е m 
pu RAN 6 2 6 6 8 е m 8 5 " 
мы MIC 6 3 i H 30 о 24 12 Я Б 
aie | 2 { ЖЕ T 
21) (12) : 2 18 * x К 6 18 [ 
Tels * 1 1 j d e OS 
ay me 6) 4 2: 2 22 50 E s 2 6 
2) (14 1°) 12 2 6 2 8 6 
2 оу 1 . a a 5 Ы 5 Н: 
)Q) i . E * 4 12 20 $6 
30300 | a. 4 В 2” 
уп i a 1 3 14 5 280 
19) (туз . : = р 5 п DNE че 
ыы: L4 42 222959 
т) (15 a " . 1 3 20 31 р 
14 Gs > 3 2 1 39 m 
154 (11) I 2 5 de 
1 
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Table 8 (cont.) 


w = 8 (у) | (8) (71) (62) 
=| 
аҙ | 1 3 3 6 
41) (1)? = 1 3 6 
% (2) (1) | 1 2 2 2 
4) (1?) Q* | А 1 2 5 
n (1) | . Ў H . 
буша | i 2 4 
2 . 1 
а | К | 1 ч 
(3) (0* 1 2 1 2 
(3) (21) (1)* . 1 3 4 
3) (2)? (1) 1 1 2 a 
3) (2) (1°) (1) H 1 3 
б) МЕЛІ : i Н 
(221) (т)? қ : 
(22) (2) (1): i : 
Gt à 5 
ӨЗІНІ қ 
21%) (2) (1) 1 
215 19 s 
21) (1)? = P 1 2 
2) H P 1 2 2 
21) (2) (11)  : > i 2 
3) (1 
Не | | | 
z : + à 
Bn jo : қ р 1 
Bay У а 
(еме : š а à 
Be]: : : !: 
13)? {97 $ * . . 
jo o è & к , 
‹ р t ` . 
ee, |; i l g 
(3) (2) 0) i 3 4 8 
a (1 M x Ж " 
200 қ А 1 : 
ч . 
ЫЙ (t % à а 1 © 
[HON i % 4 2 
(2)* (15) (1)? . 1 2 
3 02. 0 54 
СШ ME 
р i lo: 
за 
(21) (1)* З 3 % do 
eu 1 4 8 e 
2) (1*) (1)* 1 4 9 
(12) (1) | З н 1 
Шоу аа 
ШЕ 
(12) (1)* | 1 Ж E 
а» | E 8 28 56 


(617) 


(53) (521) 
1 3 
3 9 
2 2 
I 5 
4 3 
А 3 
3 + 
2 7 
2 М 
3 6 
3 2 
1 2 
i 5 
4 1 
2 2 
1 1 

3 

1 
4 8 
3 4 
2 5 
B т 
1 3 
3 3 
H 1 
. 1 
1 3 
à 2 
7 18 
5 6 
3 12 
E 4 
= 5 
q 14 
4 11 
6 6 
3 7 
s 3 
2 8 
. H 
А 3 

11 30 

15 36 

12 20 
9 23 
2 5 

14 

26 66 

15 51 

56 168 


(51?) (+) 
6 А 
18 2 
12 M 
$ 6 
9 : 
6 à 
15 2 
à 4 
6 2 
3 2 
7 B 
12 а 
: 2 
S 2 
1 я 
6 
3 
12 6 
è 4 
9 2 
3 . 
6 2 
. 6 
4 я 
6 2 
1 . 
3 + 
6 " 
я i 
24 2 
36 8 
6 6 
27 2 
. 6 
12 М 
18 8 
21 6 
. 6 
12 4 
10 5 
18 2 
1 . 
3 . 
6 2 
бо 10 
бо 20 
24 14 
48 8 
16 Е 
зо 6 
120 зо 
108 20 
336 7° 


cane: 


edd 


pU 


хо чм 


m 


0 12 
2 5 
6 6 
6 15 
4 4 
10 22 
6 6 
2 3 
2 7 
12 
=: g 
5 2 
4 5 
5 4 
6 11 
7 14 
20 24 
10 6 
8 12 
12 
i 24 
12 : 
a 3 
“ 2 
5 ° 
8 
è 15 
: 1 
Я 2 
2 5 
6 12 
6 12 
2 48 
24 P 
12 27 
1 12 
20 39 
26 39 
32 бо 
12 
10 15 
6 % 
36 
17 3 
18 
4 39 
30 60 
o 150 
№ 
44 87 
20 55 
53 д 
жс 210 
150 315 
420 840 
Же ы; 
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Table 8 (cont.) 


w= 8 (vi) 
(41) (3:2) ө 
225 2 (31) (баз) (321° 
32 521) — (319) (2* 5 
о» — = ) (әзі) — (21) (219) a5 
(4) (2) 24 А ^ 5 
02 G0) И Ы . 
Өз) аҙ 12 $ . ; 
15) (1)? $ = Я 
Ша P$ 4 { 5 
B 2 2 
Gy 9 (1): 8 8 ie 6 % бо 
(3) (21) (1) 2 E: 12 27 60 
(3) (2): Е 8 6 ё 3 
(3) (2 ne ( 
ео 4 à 2 : . 
(3) (12) (1) 20 5 2 2 3 > > 
E ay 3o чү > 2 7 20 К ді 
Ga (2) (1): is 4 5 12 30 Ы Е 
2°) (12) (1): Е 12 22 18 Е, 
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Approximate linearization of the incomplete /-function 


By A. W. KIMBALL Ахр E. LEACH 
Oak Ridge National Laboratory, T'ennesseet 


1. INTRODUCTION 


The incomplete #-function has been a favourite subject with statisticians for many years because it 
arises frequently in our work and because it presents problems in evaluation. Several tables of m 
function and related functions have been constructed and many approximations have been derived. 
Wise (1950) gives an excellent account of various approximations in terms of percentage points of the 
x?-distribution and provides methods for computing percentage errors. From his paper, and ame 
literature which is too voluminous to review here, one can ascertain that most efforts have been directe 
toward the numerical evaluation of either the function itself or of one of its arguments, usually n 
connexion with the determination of Type I or Type II errors for test statistics, the construction o 
confidence intervals, or the determination of sample sizes. When one attempts to use existing formulae 
and tables, one is immediately impressed by the fact that no single table or formula is sufficient for every 
purpose. Tables may be unsatisfactory either because the range of the arguments is too narrow or hea’ 
interpolation is required, The approximation formulae may be unsatisfactory because they are derive! 
for significance tests or confidence intervals and hence are restricted to percentage points usually near 
the tails of the distribution. 

ч The approximation reported in this paper resulted from the need for a single representation of the 
incomplete #-function that would be valid over wide ranges of the function and the three arguments. 
radiation mortality studies, an approach known popularly as ‘target theory’ predicts that, for som 
organisms, the probability (М) of death from a dose (D) of radiation is given by 


Dtq-l 
Ме X Op'óa(]-au)eee1- = I (p, д), 
і-р 


where p and 4 are unknown parameters, « is a function of D, and L,(p, д) is Pearson's (1934) notation a 
the incomplete beta function, i.e. 


L(p.q) = [aoe ajf t?7X(1— tc qt, 
0 


From observations on M and D, regression estimates may be obtained, although the use of 1.04 
expected value presents formidable difficulties. To avoid these difficulties, various attempts were mad 
to find a transformation that would provide an approximate linear relationshi between functions of the 
dependent and independent variables. To be of any value the approximation had to be satisfactory gee 
almost the entire range of I,(p,q) and had to be valid for values of p, q which were unknown, and for 
values of z which varied over an unknown range. A search of the literature revealed that, for reasons 
already cited, no available single formula would suffice. However by combini ; hs formulae found? 

the literature and by making one modification, we arrived at ^ linear T анан that prove 

adequate for the regression problem. Although it is less accurate than some approximations for 0 


ose of evaluating I,(p, q), it h: i А fitting 
Mie rm g L.(p, q), it has obvious advantages for use as an expected value in the curve fit 


2. THE APPROXIMATION 


In connexion with a sample size problem, Scheffé & Tuke 1944) gi А ion ti 
equation 1,(р,4) = а, У (1944) givo, as an. approximate solutio 


w+1-D) = 438 (12) «iun, à 


2: о, В ЕР " ы г 
where X2 is the 1002 % point of the y?-distribution with 24 degrees of indicate that fo 
0-005 < о < 0:10 and х > 0-9, the error in this solution is MEUM, S еу n atl foe large" 

25 


T Operated by Union Carbide Corporation for the 17.8. Atomic Energy Commission. 
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values of а? s T ег 

s but becomes in i 

Sona s inaccurate as x is decreased. We have found that the approximation can be 
extended to smaller values of z if the first term is multiplied by 


(B+ ME E 
— 33*19s 3^ — ) : (2) 
© following ; В 
wing modification of the Scheffé-Tukey approximation has been obtained by Tukey (1958) 
А? = 240 — у) + (2p 0) 12 9. (3) 


Table 1 show that this new form is remarkably superior to 
oes not permit linearization of the regression problem. 


imation to x2, 
1 гү 

= T — 

Ха = 24 (i-r is) , (4) 


ш-а 1 
r TES е- dt = а. 
em. IS) 


Com: i 
4; cape over the range of arguments in 
Wilson & approximation. Unfortunately, it d 
& Hilferty (1931) give the following approx! 


Where А 
11-4 is defined b 

y 

n-Hilferty approximation, when com- 


the Wilso 
e was chosen for use in 


Accordi 
Pared = уд the results of Goldberg & Levine (1946), 
he three others, does best over a wider range of arguments and henc! 


о 
МЕ s ion problem 
e ini à 
ombining (4) with (1) as modified by (2), one obtains 
Where 2p-1]i 
дебісі жә E , 


. ja 21. d L9] |. 
x [3 rae 


hus, f, 1+2, 

"гапу з, p and а, Ya can be computed readily and the approximation, 
i Ya 1 E " 

- - eM dt, ( 

1-1(рФ = [^ie 

e advantage of this approxim 

he problem linear, 

ting aids for pr 


ation in the afore- 
but the dependent 


Sbtaj 
ned ғ 
rom tables of the normal distribution. Th 
obit analysis. 


entio, $ 
ne Not only is t 


Vari d regression ious. No 
able pli problem should be obvio! abounds with compu 


18 five is a probit, and the literature 


y Combin; 
ining (3) with (4), one obtains т 
2p-1[1 25] (7) 
Е] | 
т ET %4 үл "A | 
ing I, this form is 
EM 1, the approximation obtained from (7) is identified as 1%, For evaluating L(p,d) 

n Tal Superior to (5), but it is of little use in the regression problem. ptained from Pearson’s (1934) 
tables a le 1, threo approximations are compared with exact values i7 р? erah 
T " e from the Harvard tables (1955). The range covered is =" t T ptain the exact values and so that 

ie S Of a chi i "jon would be require 9 f x are not shown. 
“Жока лш 
ing, Без of r 0-05, 0-30, 0-7 2an 951 for s ected to do 
НТ O 
е and q, the dir А kin 
1, Table 1 also к approximation obtained by (84 d (8) 
"m m: 
abl det (pra jaime 
e1 aut 
> the approximation obtained from (8) 18 identified а 
=MATIONS 
ee 3, боормивз oF THE ATP nM cn 41 өйбөр 
Whey, On OF T ‚ог than J’ for ve one-half for the binomial 
distr Рава able 1 shows that / does generally bette m a probability near as fo atively smal 
ibu,; _ ате about equal. This latter case en vides 6004 accuracy 


lon i 
inen 
Which ease the normal approxim® 10 
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Table 14. Approximations to I,(p, 4) 
I = exact value Г’ = approximation from (6) and ($) 
1 = approximation from (5) and (6) 1* = approximation from (6) and (7) 
(Numbers in parentheses are errors to the nearest third decimal.) 
| | 
p q Tı 2% ty Ya 
2 ШЕ: 0-04840 0-30250 0-70560 0.94090 
f :05109 (+ 3) -30871 (+ 6) 71294 (+ 7) 93178 (— 9) 
p -03519 (— 13) -28483 (— 18) :63577 (— 70) -96591 (+25) 
1* "04724 (— 1) -31094 (+ 8) 1290 (+ 7) | (93149 (— 9) 
| 
2| 2 |I | 0.05331 | 0.29549 0-70451 | 0-946600 
1 "04478 (— 8) :28806 (— 7) "70235 (— 2) 94316 (— 4) 
2d ‘03617 (—17) -30672 (+11) -69328 (— 11) -96383 (+ 17) 
I* -05120 (— 2) -30762 (+ 12) 71137 (+ 7) -94360 (— 3) 
| 
5 2 4 0-05097 0-30065 0.70441 0.95408 
1 "05247 (+ 2) -30372 (+ 3) “70941 (+ 5) -95176 (— 2) 
Г "05074 ( 0) -28728 (— 13) -67231 (— 32) 97499 (+21) 
T :04987 (— 1) -30253 (+ 9) -70829 (+ 4) :95134 (= 3) 
5 4 | 1 0.05054 0-29985 0:70640 0.95244 
1 04715 (— 3) | -29179(— 8) | 410221 (— 4) 95055 (- 2) 
E 04470 (— 6) :30007( 0) :69808 (— 8) :96282 (+10) 
:04887 (— 2) :30346 (+ 4) -70939 (+ 3) 95113 (= D 
12 211 004731 0-29205 0-72063 0-94363 е 
1 "04884 (+ 2) -29602 (+ 4) 12649 (+ 6) 94207 (— 2) 
Те 05688 (+10) 27212 (—20) :68092 (— 40) -96444 (+21) 
:04661( 0) -29187 ( 0) -72385 (+ 3) -04142 (— 2 
12 6 f Miner 0-29231 0-70109 0-94521 
T Een: 0) -29274 ( 0) -70258 (+ 2) -94505 ( 0) 
Т* mn 1) "28709 (— 5) -69045 (— 11) 95403 (+ x 
(= 1) | 29812 (4 1) | -70265(4 2) | -94469( 
12 | 0 | I à 
1 “ақ 0:29888 0-69144 094519 . 
r | ossa р) | 228307015) | -67950(-1ә) | -94193(— a 
T bro m 2) -29868 ( 0) -68792 (— 4) -94949 (+ n 
=D | 30057 (+ 2) 69304 (+ 2) 94450 (— 
30 8'| д 0-04909 ; 
T 05149 (+ 2) 0:29168 0.68274 0.94936 1) 
" 05657 (+ 8) :29782 (+ 6) -68925 (+ 6) -95013 (+ 3i 
T* 04878 (0) 27948 (- 12) -66150 (— 21) :96217 (+1 
29138( 0) :68408 (+ 1) -94894 ( 
30 I5 | Jr 0-05268 s 
1 ea 0-29271 0-723492 0-95542 
(+ 1) `29295( 0 з 5540( 9 
T 05413 (+ 1) 280 ) 72409 (+ 1) -955 5) 
I* 05997 (0) ез. ый! "1840 (— 5) -96030 (+ 0) 
9312 ( 0) "72409 (+ 1) -95518 ( 
30 | 25 | I 0-04282 ? 
? pes 0-29726 0-69384 0-94300 
3596 (— 7) -27380 (— 94 | o(- 9 
I -04916 (— ) 67286 (—21) :9369 
(— 1) :29683( 0) 5(+ 2) 
I* 04215 (— 1) 99810 69210 (— 2) -9449 0) 
(+ 1) :69469 (+ 1) -94255 ( 


T Abridged from a larger table available from 
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Table 1 (cont.) 
p q 
L ү А 23 x, 
4 
80 pes 
11 1 
0-26911 0-71251 
pos 69 25 0-95032 
4 quse (s 4) -27806 (+ 9) -72087 (+ 8) -95220 (+ 2) 
| 5. n (+ 6) -26175 (— 7) -70092 (—12) 95864 (+ 8) 
! E -05277( 0) -26882( 0) 1307 (+1) | -95021( 0) 
41 Е 
1 0-05198 0-30621 0-66294 0-94966 
5 05184 ( 0) 30574 ( 0) -66270( 0) 94957 ( 0) 
j^ 05318 (+ 1) -30380 (— 2) -69545 (+32) -95210 (+ 2) 
: 03176 ( 0) -30644( 0) .66336( 0) -94949( 0) 
ШЕ 
71 
! 4 0-04292 0-31229 0-68943 0.95914 
" -03102 (— 12) -36657 (— 46) -64480 (— 45) -94845 (— 11) 
5 -04270( 0) 31207 ( 0) -68877 (— 1) -95994 (4- 1) 
ба :04950( 0) -31286 (+ 1) 69000 (+ 1) -95881( 0) 
9 41 
Ў 0-05543 0-33028 0-65056 0.95911 
4 :06088 (+ 6) 34758 (+17) 66684 (+16) 96240 (+ 3) 
1 05833 (+ 3) | -32000(— 5) | 204480 96987 (+ 4) 
А 1% .05536( 0) .33029 ( 0) :65079( 0) -95909( 0) 
0 
121 1 0-05711 0.31223 0.73254 0-96066 
1 05237 (— 5) | 29822(-1 +102? (—12) | -96763(— D 
f 05770 (+ 1) | 31107 (7 D gat (— Ш | ПЕЛ) 
90 ue -05694 ( 0) .31284( 0) 32:4( 0) .96051( 0) 
0 
18 7 
1 |2 0-03961 027940 мы o (— 0 
a -02318 (— 16) .30972 (- 70) daa |= cM 0) 
4B 2 zoot 0 | TAS o | чю 0) 
- -03935( 0) .27969( 0) 72184 ( 
101 й 0-07166 
z 002152 021393 —— боШ C .97562 (+ 4 
Ы -02607 (+ 5) -23660 (+28) .69186 (— 3) -97389 (+ 2) 
„| о amal D | 565880 0) | Өле 9 
- I -02150( 0) -21391 ( 
- Е 0-93511 
A 0-02364 0.20249 —— eT 92779 (- 7) 
"0209 (— 3) | 18560 (71 | .6212(- D 98569 (+ 1) 
ï .02403( 0) -20225 ( ‘e3030 9 :93507( 0) 
- I* 09357 ( 0) .90236( 0) 
а ‚60280 0-93303 
1 | 0.05502 о nn oea E 
1 02576 (—30) 24218 БД ЕКТІ :98319( 0) 
1 05504( 0) 136048 ( d “60313 ( .93294( 0) 
| I* .05578( 0) -36096 ( 
tho author on request 
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sample sizes. On the other hand, the approximation given by fis not too bad for these cases, Lec 
for purposes of regression, and its accuracy tends to improve as x increases. As E aer dg 
q > 41, the normal approximation is — d benter than Г and in fact is good enough for a 

i e including the regression problem. P 
на 2% is Ashes: буи A inet over the ranges examined and even opor ee 
fully with J’ for large p and 4. Although 1 does not have the uniform accuracy of 1%, or of I таня 
p and 4, it is definitely preferable for use in the regression problem in the lower ranges of p and q. : es 
itshould be emphasized that all of these approximations may be used, without the need for serae д. 
for integral or fractional values of p and q, the only tables required being the readily available table: 
the normal distribution. 
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Sequential occupancy 


By D. E. BARTON anp Е. М. DAVID 
University College London 


INTRODUCTION 


Р of 
In previous papers (Barton & David, 1959a, b) we have discussed the distribution of the number 


м TUR 
empty compartments when n balls are randomly and independently thrown into N identical comp? 
ments. Various restrictions on the compartm. 


the number of empty compartments is fixed а, 
without interest. The classical case corres 
course, well known, but we think that this i 

Тһе actual distributions are simply and easily obt 
distribution of n, the number of balls required to fi 


2 s let the 
ained. Suppose Ё zero compartments wm the 


the 


4 u 
s о , the sequential distributions thus follow from 0 
. 8 Scheme has also been noted previously- 
Sy to find unless the actual probability PT ó 
е discuss here methods of obtaining these mom 


ocess 19 


CLASSICAL CASE WITH NO RESTRAINT ON B 
Тһе probability distribution of n is 


1 5-1 1 
p(n) = Желі Оқа А®/-®-їрп-1 T€ Neu, AN-k-192-1 (n 2 N—k,N —k4 1, 90 

„2 В 1818 
а fact which is well known. The moments of n are Straightforwa; d. $ ing of the bal s 
going on and we break in at the point when the (N—t)th b en oe Pposs tks са ball Jeaving 


OX, Е 
< k) boxes empty. Let there be n, balls in the (N —1) box empty before, receives on 


ALLS OR COMPARTMENTS | 
e 
es. Let the further number of balls ӨШІР | 
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up to and i 

ncluding th à 
number of } g the occasion when one furth i 
balls necessary to achieve N — k м x receives one ball be ау... Then the total 
а y is 


N 
у в = s 
Misa e) ar s 2, N-t 
achi -Qy is pie aul de | 
Chieved, Aecordingiy sequential binomial variable with the trials stopping when just one success is 
у | ja 
Кы(ау-д-(ғ- 1)! (&) xd ич) 
D 
with $ 
КЕСЕР 
or Bays oy 
Е №" 
Кы(ау-д-(ғ- 1)! (Ez) ses и) à 


In thi 
8 classica] 
Scheme the a’s are mutually independent and consequently 
_ м [N-i 
Күз) = Вы > 22) ). 
ге t! 
Кут) = ( » Lj. 
Kg) = (r— DI (Qr- 5) - 077 ( 7 J) 
) ( W-D 25 W- 


(a fo, 
rm whi 
ch 
B leads more easily to asymptotic expression). 
classical sequential set-up discussed in the 


t 3 
Pr “the distri s 
есейіп pons шн and the factorial eumulants of the 
graph are well known. It is known also that for k not too large 
s k] 
" к N е (т 51 
Whe Kun) = № тр Kiln) = № (т 2 1). 
«A 2 
E= 0 tho n—logN 
teng cumulants of = g 
With i т- N 
increasing N to 
© 1 
Thi K, = y(Euler’s constant), К, = x я ("> 2). 
ІН impli j= 
Plies 5 
that in the limit K.- _2 "log T(z) 
К д2 г-1 
ility distribution function is 
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nts are the cumulants of a variable = whose cumulative probab 
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th 
erefore is the limiting c.p.D.r. 
ple size, classical type occupancy 


he 
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It follows that 


1 m "o ])m+i+1 1 
Kim = = (reg?) «pene a, “ора 1--а 


1 
and К, = ш: 


1\ pN 
K, $ -N (от +—. 
4 4 


As in the classical type problem this implies that 


— ^-Nlog(ljg | 
VN{(n/q) — log (1/q)}! 


is, asymptotically, a unit normal variable as № — оо. 


RoxaNovskv's CASE WITH RESTRICTED COMPARTMENT CAPACITY 


In this case, as we have described i 
capacity of each compartment is r 
ments, to be left empty, 


- at the 
n our previous paper (Barton & David, 19594), it is supposed nares 
estricted to r balls. The probability distribution of n is, for k c 


] No N-k-1={(n-1)/r} 
pin) = TEL А0, 


Nr—n41 Nr 2 (es РУ а електе. T 
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a distribution which we believe is new, 
to those for the classical Sequential case в 
tof the cells have been occupied, by SS, b; 
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кыл identical, line 
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О far considered. Suppose weenter the process at the po cupation 
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It follows, still conditional on S, that 
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ll expectation we have 
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Reduction of this expectation gives us for the factorial advancing moment of S, 
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APPARENT CONTAG 
ARENT CONTAGION WITHIN A COMPARTMENT; А GENERALIZED PÓLYA SCHEME 


Followin р 
thes (k zh Ned nire paper (1959«) ifa is a measure of the contagion within a cell then the probability 
cells out of N will be left empty when (n — 1) balls are dropped randomly is 


NC, МК m" : | 
aNen-2¢ EQ n IO ау; 
PR si j=0 
re Е = ili 
remarked there, this is a generalization of Pólya's scheme. The probability that one further ball 


TOPs in а 
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ў ч. ing i own indepe 
n cupied the balls falling in them. The balls are thr 
random variable 
For thi (i) n fixed and k, the number of zeros, a тат ы ӨК 
= hon-sequential case, we define а set of random varial ә 
; fter n balls are dropped. 


1 if the ith box is empty ? 


a= 
Then -0 ifnot. 
the nu 
mber of empty boxes is N 
= gi 
ang Р à | y 
penso P 
d (ke) = m! 2 (32-55 и 
ы <<<" 
ith = ть (зау) (M> 1) 
Th 
e Probabili st - 0. 
ity 8enerating function of k is т 
N-1 m gi) 
„—1)"®т 
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(ii) k fixed and n a random variable 
Let P(n) denote the cumulative probability distribution function of n. We have 


Рип) = probability of at most k zeros after n throws 
= coefficient of z* in z(z)/(1— z). 


Write п*(у) = Ep,(n) у", 


where p;(n) is the P.D.F. of n. We have 


* 
EP,(n)y" = т) 


1-у 
| 1 ] Na 
— coefficient of z* in = f = yt (=— nA) , 
1 
where Rly) = У -------------, 
4<4,<..<4Һ1-(1-р,-..-?і,)У 
Tt follows that m*(y)—1 N-1 
ү = coefficient of zt т У (z— 1)m- R, (y). 
y-1 т=1 
If further we put y = 1+0, T*(y)—1 NES AW gp 


y-1 a1 VI 
and therefore y nin 


N-1 V-1 
Am = coefficient of z* in У (@—1)"-1 % (Pa t + Pig)? " 
m-1 


l hankin (0+. 0)" 


Two APPLICATIONS OF THE PREVIOUS SECTION 
(i) Assume a line of unit length 
Ут is the sum of m of these, then 


. Af 
is broken, divided randomly into N pieces of lengths ру, P»: so PN 
(N-1)! 
a ee | Жем тт a 
P(Ym) mI Wma” 1(1-у)М-т-і, А 
pe! 
uu еш v Pa.» Py are now the box probabilities then the Vth factorial moment ofn, the pug 
quired to occupy М — k boxes is given by j/V! which is the coefficient of z* in 


N 
(905) Ў (ag l 
m=1 


ty... ti 
тм 


0 x 
which may be shown to tend to {(47/,/3) — 6}. Thus 


4т 
Ду > Ng, (®- 
1 WE 6). 
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uch larger than that for the classical sequential Жы 
E а к occupancy scheme, where if 
іл NlogN. 


Ever si 
Since 
(De Mo; ce the early 
ture Е у paper of Stev 7 
mm m) case Kea за епв (1937) the distribution of the numb 
Counts = xample, where fun ӨЛ! Tonne Rac жене | mim d i ы ы Merk m 
Sequenti no overl z & Eadie (1957) di istri ор о 
e 2 виріз Е К зсиззе the distributi i ee 
Cigarette "er upancy dint е 246 ыы permitted. So far we dem істі совин 
Proble Bs There 3 s other than in slightly метке Ya gens 
oben Е ightly unreal probl: EM 
(ii) m ог exam oubt, however, th i анмен а те 
” ап fs, denn , that sequential distributi implicit i неу 
оҒа gregari conditioned by —€——Ó be traps and the irs al FPE: чон 
Cases we ier spirit among the her there is room in the traps, or (iii) = 7 hs eem a ната 
um азе боны ores НСО: These three conditions will pan. —Ó B катане 
У worth noting th у, the classical, the Romanovsky and the аы ее ens 
hat the sequential aspect of the occupancy аста = dde 
m may be considered as the 


Numb 
er of 
renewals i 
5 in a special form of renewal process 
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The 
generali 
zed mean differences of the binomial and Poisson distributions 


By T. A. RAMASUBBAN 
London School of Economics and Political Science 


1. INTRODUCTION 


В mea; 
n diffe 
rence g, defined by 
(11) 


ersion. Unlike most of the quantit 
dent of any се ion. But, 
n, the study 0 
dvanced quite as mu 
follow & normal law. 
ven by Nair (1936) by arather 
Jer derivation. Kamat (1953) 
of skewness fi 


Wing 2 m 

n еа 9 

Stap, 9 tO surin, 

АТИ к. За mather, the amount of scatter, 9 is in 
Other nd in parti atical complexities introduce! 
Fort ж е — some of its sampling prop 

eva, Plica, ied miil БӨР except perhaps when the variates 

Con luateg рр Method. neat the exact standard error of gl 

(19) C tureq e third 47. Lomnicki (1952) obtained the sam 

6) that for ] nent in an exact form and, from the considerat? 
of mat arge n the distribution of g шау be ollowing Kamat, 
* the ге, ® values for i И 
dig т, taken + or its fourth momen! 
ogether with the values of f, 


The "on 

Un, ie Nor, ie for втаПва 1 ls ant h I prop 

N $ mples ult: іс! r 

d ples (n < 10), theres mial and Poisson distr 


iso," Certajn tributi . 
Fia diat zi О can be looked upon as à limiting case of the bino 
ons. It is therefore natural to investigate the sampling 1101 r th 

riate distribution for g even if the exact distribu- 


ab, is ributi 
вор 0t utions 
ute Casi] and to attempt fitti 
me. UY founi pt fitting an арргор i i i 
косе ы ee SIE uri, eae 8 ae 
‚ given by 


e 
m to establish the closeness 
ted to obtain empirical 


trib iy DProximggi 
on, at least for sample size 
(1:2) 


А. = ХХ ji-j pOr) 


ij+i 
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i iscrete distribution 
for a few of the values of r, where i and j take all the integral values permitted by the discrete dist эй, 
p(t). In a recent article (1958), I have succeeded in obtaining values of А, for some of the disc una 
distributions; in particular, those for the binomial and the Poisson distributions are shown to be: 


—1 SME " n=l e n=l TTE Y 1:3) 
ланы ж [3 Г ) d у (5 Wa” T " | 
ї=0 i- 
1-4) 
and A,(Poisson) = 2Ae-?4[I,(22) + 7,(2A)], ( 


. shall 
where Z,(x) is the nth order modified Bessel function of the first kind. In the present раро [^ Бі 
evaluate for these two distributions the generalized expressions for A, valid for integral values 


2. ТнЕ EVEN ORDER MEAN DIFFERENCES 
By definition, the even order mean difference is given by 


quaes "€ (21) 
A, = EE |р pj) = DE (i= J)" pli) pls). 
ij ij 
Let £ be the arithmetic mean and дк be the kth moment about £. Then, since д; = 0, 
В А T 2r 2r 2r „+ 
А» = 00-0-0000 Р) = ль + (T neta (5 ) tt a tst tn 
43 2 3 2r—2 
2r-2 2r (2-2) 
= flor + x (-1) ү ) Даг. 

А | 6 satribu* 
Itis thus possible to express the mean difference of even order in terms of the moments of the dist? 
tion. In particular, 

А» = Эр, (2-3) 
A, = 2(u, +343), 


Ag = fig + 1515410 — 10/3), and во оп in 
H n i i i ns 
a substituting the various values for the moments of the binomial and the Poisson distributio 
:2) or (2-3) 


» the corresponding even order mean differences can be evaluated. 


3. THE ODD ORDER MEAN DIFFEREN 


р sting 
The evaluation of the odd order mean differences for the binomial distribution calls for some intere 
algebraic manipulations. We have 


CES FOR THE BINOMIAL DISTRIBUTION 


n n (31) 
А = 4—3|2r41 TA like ign-ipi (7-і 
ЕРЛЕРГЕ 

and since the sum of (i—j)2"#1 over the distribution vanishes, (3-1) becomes 
n i 2) 
A = вена (|. ` e 
2r+1 PP jy 8D) ЕШ pq 
=2 » шетен E ка. n 
775 i) н (E, i-o () p] 
2 Ў () ign (в 41 (б) n n B? 
-2 ‚| pq^-i Lir q"4- (i— 1a — di ee]. 
42,44 0 ) pp PEt] e i 
Expanding the in (3- i Қ ORT 
e Е the sum in (3:3) and rearranging the terms to form a finite series in ptg?^-*(s = 1,2. 
n—l(i т 
Nery, = 2 [ : {> (27+ 1r ( ) ( és рат 1 
і-0 9-0 2-7) M jl q 
n-l(i $ (3+ 
+7 (у (yrs ” т рз" |: 
| іі (5=1 i—j) Vi 
Тһе method of evaluation of the sums 


T— г" n an 
ES oe һе bera) "€ Re м )( + ) 


i—j] Mj 
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is Somewhat inv 

i à volved 

impair its clarity. ed and lengthy. I shall therefore omit such of the algebraical details which è 
3 may no 


Consider the sum X a ( : )( а ) 


ge 1—3) 44-7, 
Let i-k 1 
Bac $ Fate Ne. qe 
ігім-2-8/ Ve 
and i-k 
a —k \ [n-k 
Cui > yen" 2+k рі 
A” ) i-j-x) Vig +k = D! (S4), 


where th 
e ope = П 
expanded ean vers € F (8/4). The sum required is therefore Co, з |1=1 = D? S,|,.,; and since the 
So has DSo( = со) as a factor, it is more convenient to operate 2r times on сүү 


ала 
then set 7 = 1. Now i 
i n n 
Ca 35). | ee Las | 
" жел) (3-5) 


Splitti 
Dg 2j as ((i-. 5) (с Р 
J as (02467) — (i — j)), using Vandermonde's theorem, and finally summing over j, (3:5) gives 


= ,[n— 1 [(n71 lifmn-19)[n-1 
ЕЕЕ 


1 


i n—1| [n—1 
Again, суа =” 5 zi 


о 2 
оак | = Derg E n-IV № ous і-1/ п-1 1) (= 
olei = ( ; ) (-\)>”®+® i-j-1 ET Я E 
(9-7) 
ons by D on (#— 1-1) vanishes after ¢ is put equal to 
Therefore, expanding the term 


Tt can read 
]ds a factor 2. 


tty ang ily be seen that any even number of operati 
Der d that а ч à 

e~e nm пу odd number of operations y1e 

} оп the extreme right of (3:7), by Leibniz's t 


2 ?r | prale 
D*{(t— ¢-1) шыу, = 2 LG) pr (9 pe ecl (2 ) D ы), "j, 


d letting t = 1, we have 


On 4 
Putting t= 1; 
Б = Lin (3:6), wo have the well-known result 


(3-7 22 ( = Jerem» ps 
) can th piel 2p,-1 
ereforo be written as | 
с, o [n7 1| [n7 P) ou 2n X ( d ) еее t-1 E" 
Now 02-ы жа т; ie z oe 20— 
і-1 n—1 a | nas 
DS, = ; к. ee 
Plitting (9; м E uen ; x m ) (s 
(7-1) as (24.7) —(i—j— О} and summing as above 1? find, 
j ict n-2 pei es] (310) 
ыы зе Cae EC 
| Stati үлы © how that 
| Mg on (3-10), (27 — 2p,) times exactly аз before and letting t = 1 W° may 8! 
ditas à тет ЕЖ ри-ат-з+18, (3:11) 
тер, Бы -á 2 — 
" "ag = (й—1) % | 2 » gar-2n 4 2(n р 2, 2p,-1 гі 
bstitut; a i—2 Д 
ion е 
ОЁ (3-11) in (3-9) and simplification yields, 
c. -9 n-? 
0,2541 = „з [S ? E 1 m E ) emt 
à ғ m orm # ) mel, purine tet, (3:12) 
* -i i 
+ poe m 2 PE рз Biom. 46 
= 


„=1 


% 
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3.4 


where a, = 2" and аз = (2-4) 


(0— —1)(n—2)...(n—5s- 1). Е А А ; have 
M roten inn s spur the same manner as with (3-9) whose forms are identical, we finally h 


зі o a (2-8 (3-13) 
Co, 2r+1 ma а” 4 Xa] 
F $ — 
r- 5—2 
т r-p n 2r 2r— 2p, 2r— У, 2p. i 2r— E? .o (814) 
where ap= 21 У > ... р ШІ; ЗІ h=1 (s+1) 
$lm-1 pl \4P1— Ра 2p,.,4—1 


В 4 s оҒ8 
It will be evident from (3-14) that the series (3:13) must terminate at s = r+ 1 since, for n келі 
greater than this, the last binomial coefficient occurring in (3:14) will vanish, even for the п 
value of the р», namely unity. 


" А i tarting 
The evaluation of a,, in (3-14) is not as difficult as it appears to be. The successive summations, 8 
with respect to p,_;, ultimately give 


2 = zs = +15) 
wae e s oe) Qn pem Б 
те del). (28) шо m т-і1 
M n n т shall 
The procedure being the same for effecting the other sum, У, (2j + 1)7+ (, A eee d 
j-0 4-21 VI 
simply state the final result: 
: n n rtd n—s|[ n—s ) (3:16) 
97 + ILAR = fno 4 Б j 
дөнен ы Ee C2) 65 
1 8 28 25 (3:17) 
where £ == ——=== = ex 98:9 1)", 
ia (8+1)... 8) "(8 Mar шш. 


i В gums 
It may be pointed out here that there appears to be more than one method of evaluating these 


h only 
and that each method provides series similar to (3- 13) and (3-16). However, these different series are 


variations of the ones given here with the coefficients a,, and а, modified. It also seems poss! 


5 $ Б ) on] 
obtain a few relations concerning these coefficients but they are of only algebraic interest. The 
relation which I shall make use of at a later stage is 


(3-18) 


а, тұр = акы = 22?! 
On substituting (3-13) and (3-16) in (3-4), the mean difference becomes 


n—1 (r+1 n—s\/ n—s т-1 (r--1 -s on- |, 

А = | У Б а’ n'o( Е ү, ) 2i4192n-2i—1 в) n—s\ (n | zig 
PETALE à oen] v EE aes еа]? (29 
It can be verified that on putting г = 0 in (3-19), the value of A, given by (1-3) is readily obtained 


; : rm pion 
(3-19) can be modified to give an alternative form for Азы in terms of the hypergeometric une’ 
oF (a, f 5 y 5), where e 


Fle, f; ү; =) 14254240 DAE 1) a 
7 Yy*1 2 1 
+ и 
Taking account of the fact that the limits for i in (3-19) are to be taken such that попе of the bino? 
coefficients like Гг vanish, we find 


741 т-8 (из -I 
Aga = 2 > boe У ( \( n—s 
42 


х " 2i—5H192n—-21—s—1 
i=ss-1\ 7 2-8- | P 4 


т 3-20) 
+a, npg? = (" = ? (a = ‘) pen å ( 


i=s\ 0 
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Iti Я 
is then not difficult to show that the sums іп the braces on the right of (3-20) are respectively equal to 
n—s 
( E :) (pq? ,F,( п+ 228—1, 3(25— 1); 28— 1; 4р0) 
and n—s 
( » ) (pq)! F,( —n- 28, 028+ 1); 28+ 1; 4p9), 


fr 
om the fact that these sums are the coefficients of 1/р-1 and 1/t* in {(9+20) (q--p/0)" ^ i.e. in 


1-1) |" А 
(ы ah , віпсед+р = 1. 


"Thus, 
А E. n—s 
2041 = 2 22 ( |) р—д—%ьЁ\( —п+28— 1,3(28—1); 9-1; 429) 
8-1 s- № 
nm (7) prg, P —п+ 2s, (2+1); 251; 4pg]. (921) 
writing down explicitly the higher odd mean differences 


Тһе follow; 

following tables which will be useful in 

о 1 s 
m either (3-19) or (3-21) show the numerica fficients а’, and ay, for r= 0008: 


S@Sr41, 


] values of the coe 


Table 1. Values of ағ, 


8 
Ж 1 2 
0 1 
1 1 
2 4 р : 
1 40 32 : 
3 ^ 384 . 
4 1 364 1,120 ЕЙ Bie 6,144 . 
Е 1 3,280 30,912 1,488 1,013,760 | 122,880 
1 29,524 799,040 1,951, 
Table 2. Values of аы]? 
5 6 
а] 
— t ANS 
9 1 
Е 1 1 : 
3 1 ; з д 24 : ү; 
4 i ni 294 3 т 1,920 na E 
5 16 d % 
1 341 2,8 
TRIBUTION 
; Poisson 218 
The о 4. THE opp ORDER MEAN DIFFERENCES MORE iven by 
d order | ж distribution is gr 
mean difference Ap, for the Pols МА 
хил ] 
_ goad Fd “. ч Б 
Bon 2.2) par $ (#1) 
o i 4 rH 
= 27% У 2 6-37 as 
oad і-01- 
1: 152 


fo, 
* th, 
e за) 
"19 reason as for the binomial distributio: 
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The sums in (4-1) may be obtained in a straightforward manner; but this becomes quite p 
r > 2. I have therefore considered the alternative simpler approach which is to obtain the limit of (3-2 
for the binomial distribution as n > co, p > 0 and np > A. 


Since lim ,F,(—n-ra, f; y; 4pq) = ,F,(f:y; — 4A), (3-21) under these limiting conditions becomes 


n9, p-0 
np>À 


т+1 ч А-1 Да ; | ; colit (4-2) 
Жарау ЕЗ 2х b nonae 1); 2—1; 40) аса (2(28+ 1); 28+ 1; ) 


Also, by virtue of the relation 
Lu. 2 4:3) 
T1) iFi(&(2v +1); 2v 1; — 4A) = ет2А (2А), ( 


where the function Г, has been explained earlier, (4-2) becomes 
TEL А (44) 
Аъ = 26777 Y) Аа, І, (2А) +а,,І,(2А)). 
8=1 


It will be seen that the value of A, given by (1:4) agrees with the above, on letting r = 0; and 


Ag = 2A e7?X((1 + 4A) Io +4AL,}, (45) 
Ag = 2A e-?M(1 + 16A + 3222) I) + 8A(1 + 42) J}, 


after successive reduction of 1 „(и > 2) to I, and Г, by the recurrence relation 


4-6) 
1,,,(2A) = 1, (22) — (v/A) 1,(22). ( 
In (4-5), Г, should be taken to mean 1 
evaluating them directly from (4:1). 

It is of some interest to consider the standardized ratio A,,,,/(variance)"*4, namely, 


(2A). These values have also been checked independently by 


Qe-2A т+1 л) 
Deen = Sar Ў, Mea, (2) a, L2). 4 
г 


А limiting form for (4-7) can be got for lar; 


ified 
ge A by considering the asymptotic series for the modifie 
Bessel function I,(2A). This is given by 


Іда) ~ 


ет _ 47—12 (4 12) (402 — 32) "E 4:8) 
l ires + 218)? --}+о(®). i 


A А t 
This serves all practical purposes so long as v is not of order ух (Lehmer, 1943). Considering only the us 
term in (4-8), the limiting value of D,,,, for large А can be written as 


Qe-2A r1 ma 
асока, 
х+ А да), 


Бат 


which оп simplification and omission of terms of order А-1, A-2, ete, gives us 


1 И 
Dors S Jarn +p 43). (4 9) 
Substituting for the numerator from (3-18), 
Dy m 22-11, (410) 
Am 
бабы % 
When r — 0, the limiting value of D, equals 2/ т, as demonstrated in one of the tables in DY recon 
article (1958). 


5. CONCLUSION 
I may mention that although explicit results have been obtained here, it seems possible to 
certain interesting recurrence relations connecting the mean differences of different orders by conside ре 
the absolute moments of the distributions of the variate t 


differences. Thi i i ursue 
moment and I think I shall soon be able to record the outcome of this ibis being p 
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Finally i 
y,.Iwisht 
invi tice о express 71 
vestigations were iu вва Fe қара” to Prof. M. G. Kendall, under wh 
, for his continued support and 2 DORSET 
encouragement. PRONS. 
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Determi 
ination of parameters in the Johnson system of probability distributions 


Bx D. C. M. LESLIE 


Royal Aircraft Establishment, Farnborough 


aduation of statistical distribution has been 


Thes 
ystem 
of curves proposed by Johnson (1949) for the gr: 
back is that it із not very easy to determine the four 
his note, a method is suggested for 


Very в 
чес : 2 
essful in practice. The principal draw 
the observed moments. Int 
is not far removed from normality. For 


bution under study i 
ader is referred to Draper (1952). 
for fitting to empirical data, but it can also be used to 


"s researches, the generating function of a distribution 
tegral could not be evaluated. The first four moments 
ested the use of asystem of frequency curves to fit the distribu- 
he distri ver very far from normality, and that it 
he S, regi Въ plane. Therefore, this note deals 
distributions t too non-normal. 


Could 
б e fi 
ound very easily, and this sugg 


tion 
approx; 

Bave come 2 It was found that t 

nly with құрауға Paine which always lay in 6 

S variabli itting of curves of the Sy family to 

e æ has an S, distribution if Е 

2—5 
2-7 + ésinh™ (5% , 


(1) 


n tho first four moments 
heoretically). From the 
2. Formulae are given 


and А give 
arise t 
£and 


em is to det ormine 1}, Ò t | 
ding distributions which 
to calculate the д, given 7)» 6 


Бе; 
ing 
4 & uni 
Opp, an ин variable.* The probl 
o ed distribution (this term inclu 


by + tte poi 
y Point of vi ba 
nson me iew, it is comparatively easy 
‚р. 163), and from these formulae it follows that 
{o(o+ 2)sinh 3043 sinh О)? @ 
ang В, = 0-1) — a(o cosh 22+ 1)" 
2 -1 
> se дв ы us pter tr sosh 404 dulo + 9) 08h 2030 HN" 
ни ue 2(w cosh 20-1) 
(4) 
S: u and Q7 1/6. 
tionay PP о = е08 an 
н moe i and уз; the func- 
i, ftm Pid now reduced to that of determining 9 and Q from apa rm el em Ys 
f х0 Over, it quations (2) and (3) makes it clear that the problem is ee ern dus. dm 
к Y dud be solved completely if the distribution is ymer 
i 2 $ 0. This implies Q = 0, so that equation 3) reduces 
А 2 
| у dor Doe н 
2-27 1}. | 
for the standardized 


w = {/(4+2у)— = 


* 
fo, Th 
"egy © lette 
? я 
umulan, has been used in place of Johnson's 7 b 


ecause of the 
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= = £ and A are 
termines 6 direc: w] ini meters § ar 
de ile Q = 0 implies that 1] = 0 also. The remaining parar 
This i irectly, while ) 
ined from the first two moments of x. Johnson's fo: 
determine 


lized 
lae are for the moments of the norma 
rmula 
2—6 
variable E. Е 1 
! e Avtaian¢ ) 
d transforming them, we find р = £—Awtsinh Q di 
an йз = 3À*(0 —1) (о cosh 20 4 1). — 
ex susc = 
ibution i trie, 4-0, while it has been proved that Q — 0; thu 5 
Since the distribution is symmetric, Ші р 4 
Ола 
= 6—1’ 
€) being given by (5). 


rover, 
А ions. Howev d 
i f recognized functions is suggest 

lem does not Seem to be soluble in terms о à = 1. Thiss ; 
i mi nt SES eis it, = Уз = 0 (i.e. if the distribution is normal) then 3 1. the existence ; 
equations (2) and ( 257% 0—1 in powers of f, and y, may be Онега, jlify the works 
кереді: о ical nature of the expressions (2) and (3). То sim] (9) 
т = o sinh? Q 

Which reduces (2) and (3) to the Purely algebraic forms 


4-і 1) Pto 2) m 4 3( 4- 1): 
17 (о0- —_ M M 


(10) 
2(2т-+ w41)? | (1) 
= (0-1) Plo) 
w= 2(2m o 1j 
Where 
24 9o + 3): 
Ро) = МООС RN 3) m+ (65 + 30 + бол 1092 4- 
Expanding equations (10) and (11) about, the point w = l, we find (19) 
9m 2 
В agte. ay ; "m 
4(4т 4] 
Ys Se D O(o— 1), 
(12) and (13) may һө inverted to give (19 
os 1-4 у орф око), 15) 
( D) 
4f 
т = f c 
9y,— тей, + OA, Ys). air of 
" t : қ is р 
Equation (14) in Particular is attractive in its simplici 1 го small, this kno" 
В Plicity, апа vided р, and are 5 kn 
equations enables 7 and ô to be determineg very rapidly, ть Manlius ene ^ (OR A and y, must be 
airly accurately if 800d values are to b btained for 7 апа 6. 
14nd y, are not small €nough for (14) and (15) : 
times be Obtained by Using more term 
been exten 


" acy can 
9 Bive good results, adequate accur 
Xpansions 


ther 
5 е Process outlined above has, 

; and it is then found th 

usu te ары 


gore” 
fore 


at (16) 
52% Ma уно, и) 
4 20i ( 
шы ru DET NAM 
Tr His $(9y,— 165. i) а fof 
Tf these expansions do Not give ade, д а alues ^ e 
a quate aeeur, ; thi В ting V 
a process of Solving equations (19) and ШЇ) b У Aie аы" at least Provide useful star: 
labour would be needed to 9m to the thirg ord Rod 
etermined from equations (6) and (T), whi 


oun” ө 
ations; a considerable IA ms қ 
T. Once 7 ang Shave been found, Бат 
which should be Used in the forms 


(18) 
(19) 


Miscellanea 231 


It should 
be me i 
ШЫЛА: ntioned that Draper (195 vi p m of inverting equations (2) and 
Шу: Е z 1 pe (1952) has solved the probl i i i 
fit empirical а n n extensive abacs of f, and y, as fincHone o [^ Sun rm hi i s e 
a s to these abacs. For w he finds a powe i i ; oon : 
power series not dissimilar to e і 
m quation (16); he gives 


( = 1—0: 22Ty,— 5 
1— 0:3122, + 0-227, 0-047/1 + 0-0692, Ya — 0-025: 2--0(03) 
y y: 1) 


While (16) is equivalent to 
w = 1 — 0-3332, +0-250у, — 0:0964} + 0153/1, — 0-063} + (f). 


The coeffi 
Oefticients 
least got tho "Und s first-order terms agree quite well, and those of the second-order terms hi 
^ signs. There is no very obvious correspondence between equation (17) and the dt © 
ious 


or с 
Т желе by Draper for 0. 
Selected, ы suggested а simple but instructive test of these formulae. A pair of values of w and Ой 
п en bo rie eren, in values of f, and y; calculated from equations (2) and (3); о and Q Em 
ones, Үйіне obi ising the formulae given above, and the estimated values compared itd the exact 
hile those hie же from equations (14) and (15) will be denoted by (о, 0,) (first-order В 
ained from equations (16) and (17) will be denoted by (6, О,) (second-order estimates). 


he tab] 
el ч 
below shows the sort of results which are found when (0-1) is small 


@ = nj В, = 0-0103\ c -LHl| = 1:0983 
Q=01f у= ШЫ Q= uo Q = ла, 

о = а В. =0-2182\ = 112] == 1:0979 

О-065) у= с] A= Ті! Q= TP 

о = m В, = 0:5758] ш=115] a= 1-0969 

О-10) y= r3 О,- md Q= v 

и = 1:3 , = 0:0400 о = БО Фу = 1:2401 

О = ба Ya = pet О, = 0089 О, = 0.1037 

ood results. Equation (5) shows 


o give g 
eater than 2, and this implies that the 


t also be some limit on Ё,. Equation (5) 
at an expansion of the form 


long as y, į 
se M. sorio Ж Dp OU large, the second-order formulae seem t 
4 um aü Mile priis in powers of Уз must diverge if y, is gr 
> however, и "ed above will only work well if yẹ < 1. There mus 
> Valid for all values of y, as long as fı = 0. This suggests th 


9 = ууз) +) + ЛЫҒЫ e 


Might b 
e 
Very useful. Tt is already known that 
Аз = (a 2y)— p 


anq 
tho n 
ext ter 
term should not be too difficult to calculate. 
he problem and for giving 


The 
Much author is i | | | | 
е. ѕ indebted to Dr G. M. Jenkins for drawing his attention to 
aboy, pful advice on how to solve it регі to Dr N. b. Johnson for suggesting the test of accuracy given 
= H M. Stationery Office, for permission to publish this 


Note ' е по 
9. Wledgment is made to the Controller, 
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Note on а problem of estimation 
By D. E. LLOYD 
Ordnance Board. Ministry 9f Supply 

т opuCTION 
pa able А from ап unknown distribution with 
a function of r donoted by P(r). 


а random vari Ares 
h ДӨЙ харе which is 
"sts of а Seb of observed values 71, Tg. +++ m: 


dis 
е estimation problem has aris | 
p ty of b 

4 x poriment consist 


fr 
que: is 
Thug "ud function p(r) has a probabili 
e information available from an © 
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= ) а T buti а specit 
er with the number (n — m) of unobserved values. Truncated distributions are ә 

x 

togeth 


al ease —€— 
i pi 
i tion for 
i tinuous func 
ti However, in this note a case with a con 
et in practice. 

is frequently m 
considered. 


ution 
imate рата” 
ed to estima 
d known. However, a similar method could be us 
i F'(r) being assume: Б 
function p(r), 


n 
: se based 0 

j 1 with those 

meters of F(r). Estimates based оп all the available information are compared 

A 3 

s ie sie om with those based on the ratio min. 
Ty Tos Tm 


d. ле distrib 
e examples co! ere i meter of t 

th idi the information is used to estimate a para: 

l nsH > 

2. In р. 


the 
ndom, 

T T T). БАҒ і value of is chosen а 

3 e chance of an observed value is p( ) F(r) us, i а single alue o 15 

. The с : Е Thi tra 

chance that it will be observable is 


(1) 
ps [ро F(r)dr 
and the chance that it is not observable is (1-Р), 


fr. 
nge 9 
the integral being taken over the whole r& 

e in 

4. 1 а sample of size n is taken from the population, 


being 
values 

the probability of the observed valu 

Ty To, +++) 7m and there being (n —m) unobserved values is 


m - 
"C. TI {p(r;) F(r,) dry (1--Рул-т, 
ізі 
The log likelihood function is 


(2) 
m 
L = og", + 5 log (м) + У, log F(r,) + (n—m) log (1 — P). 
ізі ізі 
1f 0 is the parameter (or one of the parameters) of p(r) to be estimated " 
12 m әд п-т ЭР zx 
967^ à ap lon) — 1-Б% 
is the maximum likelihood equation for 0, 


"C, Pr(1— Pyn—m, 
The log likelihood function L = 


is 
iion! 
3 ; сала 
mlog P+ (n— m) log (1 — P)so that the maximum likelihood 
oL 


0 P æ i1—5z5-20, 
therefore 


26) =” 
gives the required estimate, " 


(9 


Езтімлти BASED SOLELY ON т, 
6. Given that m realizations 


Tos 


“Жа 
of R are recorded, the 


ill г 
Probability that the values Pis бау» УГ 

m and (я — т) unre 

Pr (m recorded 


corded values} 
and (n— m) С Values] 


m) unrecorded values 


osult » 
tef, 


m 
i {p(r,) P(r) dry (руль 


therefore 


Thus 


(9 
Дал, Ed mop 

20 У б оа Р 0 

gives the required estimate, 
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% Mons aud SPECIFIC FUNCTION FOR pn). 
а ya 0 
= _® operar < 
Thi pe) = yug en 4475 = 
18 ге i э 1: n " : 
ee а y? distribution with v degrees of freedom and unknown dispersion. 
, log p(r) = 3rloga 4 (5v— 1)logr—ar—log 7), 
hence 9 logp(r) = СЕС. 
Substituting i id ж” 
Stituting in equation (3), we find 
m» т n—m oP 
ту _ E aP _ 6 
Da A" 1-Р da m 
Put m 
n-m 
_1$,, B=: "m 
Then e | тізі 2 
equation (6) becomes 
moss m 
7а a 1-Р ea 
8.4 = i 
Biene B are jointly sufficient statistics for the estimation of a. 
Ons (4) and (5) give 
m (9) 
а) =— 
Р(о) = = 
а 
nd NX (10) 
for the oth dc Е 
er estimators of 2. 
8. ті Бексгїо FUNCTION OF "m itions of the 
°xperime ape F'(r) must be chosen, as а result of prior information, to describe the conditions 0 
- When truncation is assumed а function is chosen 1n the form 
кә-і C <”), 
=0 (>т 
— Р nable to 
; А В , it seems more reaso 
Use a er problems may arise in which, as & result of previous experiments i 
nuous function for F(r). We consider the сазе where 
Where д po) өріс” 
isa i ы 
9, For th constant, the value of which, it is assumed, is know” 
© particular functions chosen 
со 
р- Í P(r) pir) ar 
0 
P [женк 
= г) Јо 
(3v) " (11) 
1\5 
қ = ( +1) j 
Е , (12) 
1 ФР = 1 v 1 
1-Р ёх оал+ (е) 
anq (13) 
1 ar р 
Mat; Ба = 200+) i 
atio sate ions (8) and (10), respecti ur 
te for P in equat! 


ns (12 
) and (13) ean be used to substitu 
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THE CASE у= 2 


10. This is the case where r is the squared radial distance of а point distributed in two dimensions in 
circular normal form. 


Equations (11), (12) and (13) give 


-і 
Р= (1+) з (14) 
1 ӘР ү т\з (15) 
ра 12) 
and Ba 1 налуы, (18) 


Using equation (15) to substitute for P in equation (8) gives 


A 1 
ape О: 
Ао 6 +в) i 0. 


(17) 
The positive root of this equation gives the maximum likelihood estimate of a, denoted by &y; . 
11. Using equation (14) to substitute for P in equation (9) gives 
1 n 
Ld ш m 
therefore a d (18) 
AB 
We denote this estimate of о; by&, 
12. Using equation (16) to Substitute into equation (10) gives 
1,1 
ARE Eal -1- 
А a UL +@А)-ї = 0. 
А-А 
Hence == 2, 19) 
a= ( 
We denote this estimate of x by 8, 
VARIANCES AND EFFICIENCIES OF ESTIMATES 
13. For large n, the asymptotic variances of the three estimates are found to be 
^ a? 
var (Zu) = —(P— P24 pi (20) 
2 
var (ĉn) = (Р pai (21) 
^ о? 
var (24) = — р-з, (22) 
п 
If we measure ‘efficie: 


ney’ in terms of inverse variance and regard i % efficient, Fig. 1 
shows how the large sample efficiencies of 8, and 2, vary with P. & кунын Д кы ы Р is large, 
аҙ when P is small, as might be expected. Тһе E r 


sum of the two efficiencies is always 100 %. 


Y 


2 


[2А] 
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servable values 


OL 
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P, expected proportion of ob 


Fig. l. Comparison of officiencies. 


On the non-central chi-square distribution 


By MUNUSWAMY SANKARAN 
Presidency College, Madras 


1, А 
bdel.A 
Ау (1 
(1954) has shown how the transformation 
(1) 


is the number of 
ibuti on-central ah where, fist > 
фонын ың ositive constant. To determine h, 
ies inz-*. He finds 


At ^ 
i ;Treedom 
ie the et the non-central parameter and 
ely) sos Pansion that h ans of y expressed in terms of cumulan о minimizi 
reducin ù = } is the appropriate value to choose for d 
: : ede level 
ы Бох atio the higher cumulants, к.(у) for > 2» to a minimum ›. He then 
us 3 
ее. assum, i 
e SAU A). that (y’2/r)t is normally distributed about 1-21 +)" и 
3 Tnatively i i i 
7) e panas s he suggests a ‘closer approximation » which consists In applying the Cornis 
to the standardized deviate, 
(2) 


lops two methods 


th variance $(1-4-5)/r, 


ih-Fisher 


Where y—K (Y) 
P= Деку 


is ара; 
z^ Даль Soin taken as 4 
ame] 2c Present ра; : rmati В 
per we shall use Абу? transfo ation (1) ae 


Cur, Y» № = 

ulant оғ $ here, ур, 

° jua ne eee values 50 determ і 
еды dure adopted bY Wilson 


+ instead of setting а fixed value to h, 
d that the leading term in the third 
& Hilferty ( 1931). It follows 


(3) 


that f 
Y vani 
nishes, thus following the proce 
3A) 
АЁ 2(f st 
wi LU _ y= , 
here hal-geal-3 TED 
22:5(5- 1159 (4) 


в, = 
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lants 
у st four cumul 

2. It will be seen from (3) that } < ћ < 1. We have for the firs 

i th cumulant of y 7. It wil 

om the value of h given by (3), 


k 
k —2)(1—3n) № 
к) = LEM Dg EA. 00-2) 1-389 с — 
j PHRES BREINE, of ), 
+1) (0—2) (а 3) 1818 4 


ki 
k. 2 
и 


à 2 1 
Т7 пе шма на аы ) 
TAA 


=. = (5) 
p 
а 27, 1 
[2(23 — 49h + 2342) k? — 3k? k,] | 
куу) = (1—1) 3s О 


Kv) = phts- io 


1 
n 
1 
Kly) = o( s... 


3. Two ap; 
(іш) First 


ANDE В В Р 

oximations will now be considered. А 1 | T 
о We take y as normally distributed with the following mean and 
deviation: 


% зру 2 

Gy) =1 TA(h— Dis —h(h —1)(2—n) (1—3A) Sii" (6) 

hit (1—1) (1-31) | 
=—11- fes. 
ет fı — dH ОЁ 
aper 

Tt should be noted that this approximation is similar to that given by Haldane (1937, р. 398) in his p 
‘The approximate normalization of a class of frequency distributions’ 
mation Н. 


ion.* We shall i 
expansion to the Y ofe 


in this case арр 
quation (2), where h is as before gi 
from (5), using the k, of (4). Tt follows that considering te 
of the standardized а, 


eviate Y is given by 


rth 
ly the Cornish-Fisher form of the Edgewo 
ven by (3) and the cumulants of 
TMS up to order r-t}, 


2(23 — 49h + 232) да 97.27, — 4(4 — 5h) kak] 
Y. EA po ay E PPH ма ag hls AA- ялын 


y аге dei + 
the 100% percentage рої 


(ii) with h = 3. Further, the sim 
Edgeworth approximati 
above (but not given in the table 
but Aty’s is so 


% point both Eq 
mewhat the more accurate. The H ap 
the table, 
5. Ibis difficult 


Proximation is n 
not available for с 
percentage point 
Some conditions 


Similar approximate 
i), is here Somewhat better than P (t 

‘han his Simple norma] approximation defined Ee , 
Seworth approximations are g 


in 
ot so good here, and is not shown 
to draw any general conclusions 


T gre 
à DS because exact values for other percentage points 
omparison, and it is by no means certain that an 

will also be go at another. The p 


e 
арргохипа {оп which is best at dos 
80 be resent, Investigation has, however, shown that un 
approximations based on a fixed value ofh = $ will nog be the best to use. 
* I owe this Suggestion to Dr Е. М. David. 
T Strictly this includes algo the term 
well as terms of : 


TQ is aS 
ad T 65-3). It is felt that omission x e о 
; ОА NL] not affect the conclusions Teached here. The inclusio! 
nly stre; hen t] i = 4 tions 
Біле Y strengthen the con onsiderably increase the computa 
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Table 1. С 
+ Compari: " i ? distributi i 
parison of exact percentage points of the x^? distribution with the values obtained 
from various approximations 


Upper 59$ points Lower 5% points 
| | | 

| Edgeworth Edgeworth, 

f 4 " Aty's | | expansion Aty's | expansion 
exact ‘closer | Approx. | to the Exact ‘closer to the 
| approx.” H present approx.’ | present 
| | арргох. арргох. 

— iac ! 

» 1 8:642 8:38 8:58 8:656 0-168 0-171 0-157 
4 14-641 14-62 14-63 14-661 0:646 0-643 0-654 
16 | 33-054 | 33-08 | 33:07 | 33-060 6-322 6:320 6-331 
25 45-808 | 4533 | 4531 | 45317 | 12080 | 12.077 | 12-085 
$ 1 | 11707 | 1167 | 1169 | 11681 | 0909 | 0908 | 0-918 
4 17-309 17.27 17.28 | 17:318 1:765 1-766 1:763 
16 | 35427 | 3544 | 35-43 | 35435 7-884 7-885 7-893 
25 47-613 47-62 47-62 47-621 13-733 13-732 13-737 
7 1 | 16004 | 1599 | 1598 | 16-013 2494 | 2494 | 2497 
а | mas | зый | apes | мае | Stee | "NEED | Sn 
16 38-970 38-96 38-96 38-978 10-257 10-256 10-264 
25 | snosi | 51.06 | 5146 | 51968 | 16226 | 10227 16-231 


! 
m of an earlier version of the paper. Ialso 


My thar ке 
nks are due to Dr F да for her helpful criticis 
sche spec ener Е Рет in the preparation of the paper for publica- 

tational work in 


had th 
A e benefit of } ; D 
ti қ of helpful suggestions of Prof. E. 
keen Е inally, I pe e: my appreciation to some of my students for compu 
exion with the table. 
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s of a matrix in multivariate analysis 


On the distribution of the largest of six root 
LAI* AND CELIA G. В. 


he Philippines 


ANTEGUIf 


Bv K. C. SREEDHARAN PIL 
The Statistical Center, University of t 


1. INTRODUCTION | тə n 

Во . mepotheses and confidence interval estimation in 
Y (1945, 1953, 1957) has shown that tests of certain ДУР ith the distribution of the extreme charac- 
issociate distribution of the extreme 


Multivan; 
а ariate situations сап be based on those 45 v P зач, The 
1С roots ofa matrix whose elements are functions 0: two other supplementary parameters, m and m, 


Loot, қ 
W] іо Черепа on the number of non-null roots, 8, Ми ena P is Г mbar of. samples and the size i 
are functions of the number of variates апе, ins 


© samples, ей u 
Statistics and Visiting Professor of Statistics. 


h Unit Vati i iser i Mathematical pan 
ed Nations Senior Adviser m Mat у Lecturer in Statistics. 


T Research Fellow, 1957-58, and pre 
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-null charac- 
ivariate tests and the non-nu Et 
А theses are often posed for multivaria gee 
Three different DES ч ШЫ arise in each test have a distribution, the form of re res Duo 
teristic roots ү jm all the three tests. Roy (1939), Hsu (1939) pos pue 
i i i rm 
E е ilie distribution of the roots, Ó,, ...,0,, which can be given in the fo 
pen 


a) 
105.0) = Cmn) TE A-O TL 0-6). (0-< 0, <u. <0, <I) 
as ici i»j 


i+2 
te i {=ч ) 
ізі е 

ies = TER eT) Ты 
Пр) r(ntit1 riz 
ізі 2 2 2 — 

and the parameters m and n are defined differently for the different types of hypotheses (Pillai, 

In this paper, the cumulativi 


Я ith 
à died wit 
distribution function of the largest of six roots has been stu 
percentage points, 


(2) 


a view to tabulating the upper 


Pillai (1954) gave the ex 
Taking the cases — вап 


ОЁ six roots, useful for comp 


n+4(] snail 
T(2m+5)T(Qn 46)” (1-2) 


[3(m 4- n+ 5) (m+ 2) + (n 4- 1)] ano а)" 
+4) Г(2» +6) 


Б 


t 


27+] — уан 


(3) 
) + 6) атн х)", 


е derivation оғ (3) the reader ін referred to Bantegui (1958). 


largest root for $—2ands-5 Th 


the 

А т 

» Бауе the upper 5 and 1 % points m ап 
= 0(14and varying from 5 to 100 

gures to ensure sufficien: 


y the 
that of Pillai and using equation (3), obtained т 
95, and 2. For Computing the values of Pr iN for 
e 9 Percentage Oints of the largest ГО ue 
en examined and backward Tino intions чы тап n; eem value and a few = 
probability aboy, o iin Carried out using two values, one £ 
vel. 


, nce 
$ e ts f the distrib Hib a orcas be the differ " 
Probabilities. Pillai (195 »Sen (1957) ana een (1957) madesue 
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Table 1. Upper 5% points of the largest root for s = 6 


m 
n 
0 1 2 3 а | 

5 0-8246 0:8499 0-8685 0-8830 0-8945 

10 -6552 -6917 1206 1442 7639 

15 “5371 -5758 -6077 -6346 6577 

20 -4535 4912 -5231 -5505 5746 

25 -3918 -4276 -4583 +4852 5091 

30 0-3447 0-3782 0-4074 0-4332 0-4564 

40 :9775 :3069 +3329 +3563 +3776 

60 +1995 +2225 +2433 -2624 9801 

80 :1556 :1745 +1916 +2075 +9994 

100 1275 1434 :1580 1716 -1843 
130 0-10036 0:11319 0:12504 0:13615 0-14666 
160 -08272 09348 -10388 -11284 12175 
200 :06702 -07586 -08409 -09186 -09926 
300 :04545 :05156 :05728 :06981 106790 
500 :02765 :03143 03498 -03835 04160 
L 5 01772 01946 02113 

1000 01397 -01590 


points of the largest root for s = 6 


Table 2. Upper 196 


Гүр 
m 
n ————— Я 3 4 
0 1 к 
г 0-9169 0-9255 
араб 0:9065 ) 
: p p 7724 “a 7088 
10 7113 Roe 6619 ШЕ = 
15 -5986 -5462 :5757 00 E .5559 
a ium -4790 5081 :533 
` à 0-5011 
0-4261 0:4542 E 4177 
49 m 3484 Heu -2948 3125 
Ж +3194 ОБАВ +2757 2342 +2493 
| -2315 ; -2181 қ -2072 
19 1814 ks 1803 138 
0 1491 З 
0:15457 0-16536 
0-14314 119824 -13754 
130 0-11762 E cm 11901 en 11232 
160 09713 pon -09659 ДИЙ) 07701 
200 07880 pe :06594 04388 -04727 
m :05355 pue Roos 102920 -02405 
3 Е 0204 
1000 br :01855 oan 
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f approximation 
2,3 and 4, respectively. A comparative study of the values of the error of app 

dies for s — 2, nd 4, 4 

jue пева obtained indicates that; 


1) There is gre Wi i «act cases in the 
i ater agreement between the probabilities for the approximate and ехас 
а) F К 5%. ilities in tt bper 5% points occur 
upper 1% points than in the 5% | = tiesi " per зы itso 
in п which опг i ives a difference of only one in the fourth decimal place 
i i lace, which on rounding gi 
the fifth decimal place, di f 


i ion i э slowly as the 
3) Taking any value of the parameter m constant, the error of approximation increases s 
is . 
ғы parameter is increased; such increase occurs onl: 
the fifth decimal place when rounding. 


int for the 
The error of approximation was similarly studied for s = 6. From Table 1, the percentage po 
parameter m = 0, n = 500 is 0:02765. At this 


roximate 
percentage point, the numerical value of the m va. The 
D.F. is 0-9500515, while the probability based on the exact formula. is calculated 5376 0- hs vost dis 
pure approximation is then equal to 0-0000410 which occurs in the fifth decimal p ое каче чү Т. 
consistent with that for lower valuesofs. сап therefore be concluded that the ш pase c 
is sufficiently accurate for the computation of the upper 5%, or lower percentage points o Б 
largest of six roots. 


, ror in the 
Tt is to be noted that the error of approximation discussed above is not the same as the err 


е : E = 9 based 
acomparison of the percentage points obtained, for instance, for s 7 
illai, 1957) with Foster’ 


i ity in 
y in the sixth decimal place or at most is unity 


is point for s = 2. Since the er: 
with that computed for smaller values of 8, 
evident. 


parameter ranging up to 1000. 
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The rank analogue of product-moment partial correlation and regression, 
with application to manifold, ordered contingency tables} 


By ROBERT H. SOMERS 
Bureau of Applied Social Research, Columbia University 


[rr (1944) has demonstrated that the ordinary product-moment correlation, Spearman's rho, and 
em all's tau, are all special cases of a generalized correlation coefficient. This paper will demonstrate 
K а general ized partial correlation coefficient can be defined, as a simple extension of Daniels’s theory. 
a endall 5 partial tau will be seen to be a special case of this product-moment type partial coefficient. 
: hus, it is neither coincidental, nor remarkable (Kendall, 1955; Moran, 1950), that Kendall's partial tau 
15 the same function of total taus that the product-moment partial is of total 7. While the partial coeffi- 
Clent described here is of the first order, its form suggests a generalization to partial correlation of any 
order, which will be investigated at a later date. The theory will be seen to be immediately applicable to 
Partial association in manifold, ordered, contingency tables, inasmuch as the partial tau here described 
18 equally applicable to the case of ties among the rankings. As a further extension of product-moment 

Cory to order statistics, the regression equations implied by Kendall’s 7, and 7, will be described and 


illustrated ot rs 2 S aen dier 
, and the bility of partial regression indicated. 
E un ed са it is hoped that the generalized 


No remarks are made here about sampling distributions. However, ie genere 
Partial coefficient defined below will contribute to the difficult problem of the sample distribution of 


Partial tau, 
Daniels's generalized coefficient of correlation, 
Хау8; 
P= Атағы) 
Kendall, 1955, Ch. 2). When considering the correlation 


18 constructed i i anner (cf. also I ч Е 
Over ж ob jects anas ccm p» Q, the objects are first numbered in м, шы: em mn 
n? permuted pairs of objects are formed. Each pair is assigned a certain put E iei s ids Ты 
> and b,; for variable Q. The ordinary product moment 7 реа - S WA rho becomes a special 
Score, a... is the difference in variate values of the ith andjth objects: к dum score assigned to each 
Case when the difference in ranking, rather than difference in variate va ys score of {+ 1/— 1) for pairs 
ре, Finally, Kendall's tau is a special case б rine Ө ihi ib ийе and 0 when the ith 
келінін the jth member of the pair 2254 en centena have the same rank. 


| | ] : 
Ject is paired with itself, or when the ith ап o d Se 

| i i i d, contingency tables may 

рше и ы E^ correlation for such data. 


This last ] D 
5 St case gives Кепаа 7). В ant б 
Tankings of alise with many ties, Kendall's Tp к к бола м ии 
oF convenience, we shall change Daniele nome here q = n°. The manner of assigning scores 
Басаев iable R, for i = 1, За, xi . Thus Daniels's generalized 
is unchanged, ies edicion aniels' ауу become ts) for example. 
сое ее ; sim 
nt becomes Eui (fori = l9) 
Гро = "уш Evi) 
scores as components of vectors in 
al, orthogonal basis. Thus, if we 


vi and W; 
P, Qand R to represent vectors 


fixed, norm: 


П consider the и, 
bols 


в taken relative toa 


Y ee following discussion we sha 
R, we shall use the sym 


ave Siona] Euclidian space, alway: = 
havin Objects ranked on variables P, Q an 
Е 9 = n? components, 
Q = (0% m» 


К = (01,102, ща). 
P = (uus suey Ча) 

Tu | 
Ther. аз in ordinary vector analysis | m 
ІРІ = (ubt n 
eau of Applied Social Research, 


252 of ere ose analytical problem led 


T This arti : ; Publication No. A~ $ Menzel, whose à Й 
Columbia on кес; = bap уң оу colleague, Неге, The preparation of this paper was 
rsity. я d s ; 
У the pursuit of plied Social Research. s 


ci] d Whose ingenuity aided in, of Ap. 


faci: 
Ad by funds obtained from 


the Bureau 
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1 it vectors 
and we shall designate the unit vi З | Қ є. m | 
=— =>, a= 
ampe ^ [9r ÎR] 
Jar (‘inner’ or ‘dot’) product of two vectors is, as usual, the scalar 
The sca! 


(P.Q)= Uy, T Us, t+... + US. 


r 
tion of vector (0! 
i ion. We need to employ the no , А 
is poi diverge from the usual vector notation. у ы | едой аир 
| А% this m But ihis isordinarily defined only in three-dimensional space, and - ә aero еН 
e dimension. Тһе generalization of the vector product = such a e Meuon кере 27 
+ i 5 n ll avoid introducing n г ЕЯ 
tion (Kaplan, 1953, p.161), but we shal d о nil 

DA RR Aeon which can be treated as а vector in а companion space, j 

5 e; 
Hn (P x О) = (шо, — uv, u Vg изо, -+s Ugg Vq— ug Ue a) 


ie. a vector whose components are determinants of the 2 x 2 matrices of the form 


шош 


‚ alli <j, lexicographic ordering. 
v; vy 


This vector therefore has %4(4- 1) components, 


the vectors P and Q. (Note that (P x Q) — 


dby 
and is the vector perpendicular to the plane define 
Having established these definitions, 


-(QxP).) 


we see immediately that 


Тро = (0.03), трр = (01.0) and тов = (03.03). 


Y Gy). (64 хаз) 
We shall show that пав (аа хо). (а, хоз 


(1) 
A (а, x ®)| |(o x а4)| 
is precisely the partial tau defined b; 
1955, p. 118), when 


: uct 
e representation of the ordinary prone 
(cf. Kendall, 1948, p. 372) as the cosine of a different, but equal, ang 
з. In the ordinary representation 


Which is equivalent to the above form, 

and immediately su 

generalized to any 
In order to sho 


Tel 
the former has obvious advantages of representatiOP* 
to а partial of а 

number of vectors. 


hat 
› itis only necessary to observe t 
(25.04) (cp .0¢3) — (01.03) (24.04). 
will be demonstrated in 


1) = 1. Могеоуег, 
Should be, Тов 


ICETA] = Jte Ха). (4 ха,)] = 


(ад x o4). (o, x 93) = 
This is a well-known i 
true, then, 


is i8 
the general case below. If ык A 
(а.о) = Тов, (01.04) = са 
ТРТ рд. In the denominator, we hav 


Mo.) (25.04) — (6 .2,) (1 .a)] = V(1—Tpa)s 
and similarly for |o x ®›)|. We have thus reduced th iti Ч tial ien 
Furthermore, the form of the partial immediately сд 5а s ш сары: ads ; 
covariance. Tt might be noted that the partia] varii 
is the 2 x q matrix 


5 А аа— bc 
The phi coefficie %, 3 
T. f i fourfold z Vila-+b) (ae) (b +4) (с+а}’ it should be noted, is the product 
лоп of a fourfold tal le. 


a 
-moment cor? 
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We shall 
А show that thi. К 
Partial т as defi this equivalence betw i i 
f Mes een partial 7 as thi 
efined by Kendall, holds by enumerating the heri dip cie Базан дал 
. We have: 

& = (о), 
ба (of tas 1)» 


о (wj, Wy, «Gs 


le. unit 
vectors, whe: "ox < 
re, for example, и; = u;/[P|. Kendall’s partial т is 
" - Уруш; — Eu; vi ujwj 
ORE = тү (Supo (Хи 
y ivi) lC (Хи) 


We wi 
vill show t 
same а E numerator of this coefficient is equivalent to (2, x аз). (2 X 03) Since we used thi 
rule for vector products in establishing the denominator, this will оса for Miro 


? equivalence, 
Y definiti " 
ion (and omitting primes on uj, v; and wi) 


(о 
1 X Go). (а = 
2) + (6 хаз) = (u, vg — из) (5 Wa — ишу) +-+ (изба Ug) (ty Wa — Мао) 
Жн са " + (Ug Vg — gg) (gs — Ug Wg) +- + (иа -100— eai) (tga We Ug Wet) 
n be brok ; gus > 
oken up into some positive terms with шіруш;, and some negative terms of the form 
ese two groups of terms are reduced as 


"tap, 
о, v; M М А ] 
+ vjw;), the former for i + j, the latter for ? <3- Th 


Ollows: 
Xujvw,- Ури ууш Хин (+), all 17= ЖЖ 
i 
E 2, зи ель, + vw) = ppu Enn usn Prid (all i,j). 
ut, since these aro unit vectors, Eu = 1. Thus 


ала (ау ха). (e, хоз) = Eti МЕ игш(огшу +0700] (all 1,7), 

n 
we : Я 
ауу на need to show that the negative term is Уши 2 uw, Since the matrix of terms 
#140; + vjw;) is symmetric 


% 
М5 uow + оуш)] = D wares (Vie t ms 
i<j 
=) u; Euv w) + > шие, 
i<j 


i<j 


)+ Xuiviws (all 4,3) 


w) + Уши = Xuv EUW 


r formulae will serve for illustra- 

computed from Daniels’s 
rules give a numerator and 
thods. Similarly, Daniels’s 
dant information. Since 
those consisting only 


s of the vecto 
olved in T when 
the Daniels's scoring 
1sual computing me 
by adding redun 
f reduced vectors, 


And the equi 
" mi me is established. 
tion, бт rical example of the partial 7 
Bneralized ort licity we shall avoid t 
Nominat, coefficient. As Kendall (1955, p. 20) notes 
ae ation i of т twice the size of those given by the us! 
980 voor. creases the size of the vectors we have to consider, 
nt terms ia ud large in any case, we have computed by means © 
Ed uch that i<j. " d vectors by the su erscript,* we 
See th, 5 the example of Ken 5, p. 117), and denoting the reduced vectors by p рі, 
р.115 ° О and R each aer er а the scores assigned to the 18 pairs as кы ү Ы 
fashi 7. Obviousl [Р*| = [Q*| = Тегі = 15, and the a, are unit vectors constructed In Tm 
s d Then (o 8 » |-10 | A 45 уғаге vectors with gm(m — 1), or 105, compon T его 
digg; (n — 1), 3 та ‚ (01 хаз)* an ar ct ranked. Each component is & i E o ach 
tl Suishable pai ing the number of obj? 0. In the tied case there o be scores о! 
pair of pairs, the score bei 


computed by mean! 


he redundancy inv 


ng + 2/т or 


m. A 
n this example, we find 176 
las x adl = 5-405419) = 18 
216 
(аха = 4057405 In -/ Lu 
ang 4(6-15) 
(аха m xt = 8 ' hh 
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—36 


= = -0:185, 
giving тавр = 7216) (176)} 


in agreement with Kendall. 


i iz t these 
For those who work with manifold, ordered, contingency tables, it should be emphasized tha 
ы operations hold equally well for ть. Thus, when there are tied ranks 


Ten pg tps _ 
Мт) От) 


Toor.p = 


is also the product-moment analogue of a 


partial correlation for contingency tables.T 
As illustration of the meaning of Kendal 


П ions 
1Г tau, it may be useful to consider the regression equati 
e (Q.R) 
implied by the product-moment analogy. From our definition of Tor = = Токі , wem 
- Thus, regression coefficients may be defined ка i 
е of total regression, for variables P and Q with n 


Р. P. 
в = fy -PQ (Р.О) 


ЮР Вр’ 


(а.ш) it is clear that 
the variance of scores on Q may be defined as |013 


same way as in product-moment theory. In the cas 


Я on 
have six objects ranked without ties, 


, since they are of the fo: 
airs give the following 


. T Sucha partial co 
In a recent text; Sidney Sie, 1 
York, 1956, ы p 


ied case was used į г 
h In an illustrat; 
, Nonparametric Statistics for den 9 


pion? 


; 1яса’ 
xample, without justifie New 


ehavioral Scientist, p. 227. 
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Table 1 
| 
| Level of knowledge 
| Low Medium High Totals 
Voting i 
£ 1n accord: Zi Y 5 

CHE. bmi ance with Xen 46 55 39 140 
No 9 35 38 82 
Ex. Totals 55 90 77 292 


with many ties, there will be 49,284 permuted pairs 


Since thi 
this i r 

his is equivalent to ranking 222 objects, 
where Q represents knowledge and R, vote. 


enterin b 
in 
E into the scatterplot, in the following manner, 


5448 


by Kendall's 7». 


observations implied 


Fig. 2. Scatter diagram of pairs of 
(Agai ; т 
їп, the circled numbers are the number of permuted pairs at the given points.) 
Его 
m th .3237 _ 0:282 
e Scatterplot, ГА 3237 0-201, fi 11480 -0 


~ 16115 


Agai 
p" means are 0, and the regression equations become, 
Q = 0:201 В, В = 0:2820. 
оша expects is exactly th 
of ties among the rankings is 


e value of T; 


? val 
че of д) 3237 = 0-288. This as че w 
(6115) (11480)) : баб тікі 
Scat, Eency table given did Tt will be noted that only » uem case om tho redundancies in im 
о : і 
Sin Plot, > the regression coefficients to be unequal. : Fig. 2) is the reverse of the last, and 
+ will be noticed that the first тоз okipan of the rule used in assigning scores; 


e, "ly f& 

LES or 
the columns. Again, this redundancy enters in » | 
roduct-moment statistics. On the basis 


Furg e 
or p ther y f i 
ал; ork is j istics analogues P ; multiple correlation and 
"eg інісі “Ni аш progress on the order statis Қа тезі general partial ке ee forthecaseof tied 
S eory nnd жены ей, Similarly: the correlatio ^ 

rder statistics can be £ 5 


Contin 


Possible 
terplot ( 


because 
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i i hat does not 
i i i for a contingency table t т 
nks i to assymetric coefficients of association ; \ А à lum fA 
d 8 E wie An Mon i their maxima when the observations in any row or colu: 
have ordered у: ‚һе. ó 
ee да i : momen 
. їз s an alternative point of view to those who reject [ene pus 1954): 
B y Е ional meaning’ (Goodman & kal, 
aes i ds that they have no ‘operational me: g el nie 
derivatives as phi on the groun: Л 1 кы | ( dir 
i there are other times er 
; ch meaning for a coefficient is often desira e, t k bn aitari 
о ртеаїег ce eem Often both theoretical power and operational meaning t€ io) 
The resent investigations have resulted from a need for a coefficient of association in eer i eer А, 
ке өтерін tables which would permit logical inferences to be made about operationally 


3 adipem а а partial 
intervening variables. Thus а coefficient was needed which would give information about a p: 
coefficient when only the total coefficients we 


in this сазе 
re known. Obviously, if a choice had to be made in this саве, 
theoretical power could not be sacrificed for the sake of operational meaningfulness. 
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The busy Period in relation to the queueing process GI/M/1 
Bv В. уу, CONOLLY 
ABSTRACT, A t 


А і h 
ica] ж өппехіоп with the simple queueing process bi 
nmerical comparison is made in а few typical cases botwe 
land D/M/1, the latter being of 


ico pas 
Pendent input, while the servic 
‘Markovian’, type, viz. 


of then. 
Т onseeut; 
з18 given by 


completion of n с ive servi А Ssumption 
a тузсе i 
instant t. Periods in 


x for service is 
an intery, 


al (0,1), the ( 


" f the 

that the probability balt) 05, 
ЖО) ш ЖАҚСЫ 
Denoting the P.D.F. of inter-arriva] i N 


4 the 
+1) being in progress 8 


a2 
rvals by 
:3) 
^ a(t) (0 & t o) а 4 
with mean а, we shall indicate by е (4) th th 
: Ed n е P.D.r, s г Jeng 
E n & which time n consecutive Service periods are ез и ep mter-arrival — s the end 
of the interval, Thus. ™pleted, the (+ 1)th being in progress а 
Cult) = a(t) b, (t). 


a) 
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Mos € TAS — 
oe Алы following investigation will involve Laplace transforms. In general, we shall use the corre- 
Б Greek letter for the transform of a function denoted by a Roman letter. 


2. THE BUSY PERIOD 


Fr B : H 

E in the Server 8 viewpoint the progress of a queueing system ofthe type considered here is a succession 

be ernat Ing periods of idleness and full occupation. It is the latter which are called Busy Periods (often 
eviated to p.p's.), and the object of the investigation is to make probability statements about the 


lengths of sj. 


$ The length in time can conveniently be treated in the manner of Bailey (1954). Essentially this con- 


Sists į 5 E Pa A 5 В с 

а а study of the special process the starting point of which is the arrival and immediate reception of 
zice В 

de ісе of a customer, and which terminates when the server becomes idle once more. The crux of the 
Sent treatment is the simultaneous investigation of length, measured in time and number served, by 


>= an tegro-difference equation technique of the type used in Conolly (1958). This is a versatile method, 
the author has in essence used it also to study the B.P. in connexion with the system GI/|E,/l (an 


ыы of the present work), and with E,/G/1. Earlier work by other writers is mentioned in Kendall 
51), and, more recently, there is a comprehensive study in connexion with the processes M/G/1 and 


DIG by Takács (1955). i : і 
À 2 Specific, іп this paper we shall investigate the probability density functions p,,(¢) associated with 
© termination of a в.р. in the small time interval (t, t+ dt) after it began, а total of m customers having 


received Service. Then x 
Pm sr Prl) dt 


is i i i inti 
the Probability that a total ofm customers receive service during a B-P., whatever its length in time, 


(2-2) 
= Ð Palt) 
p) P 


(2:1) 
and 


, 


ы ed. In 
18 the P.D.F. associated with termination of the B.P. in (t,t+dt), however many customers are serv 


Condit; 
Nditions favouring a steady state we shall expect to find that 
d (2:3) 
f p(t) dt = 1; 
0 Td 
Others; ; «o probability will exist of B.P’s. of infinite 
ration, © the integral will be less than unity and a definite рг 


3. Basic THEORY 


0} і Ше event: 
1 dime P.D.F. fmn(t) associated with the following composite | 
s to receive service; 


(і ; i 
Па Customer arrives at ¢ = 0 and immediately begin all interval (6,240); 


(ii) ag 
am urther arrival takes place in the subsequent sma. ueue is n (n > 1); 
(iv) 48 a result of this кейіді the total number waiting 1 ar succession is at the counter (m > 1). 
M at the timo ¢ of this arrival the mth customer to be servo 
D, for» = 1, we have t m=1 5) 45; die: 
fal = | Siu edat tm 
- от=1 (3:2) 


for = olt). 
Ju) = € any customer from 


m 
Th 22, together with in (es RT 
9 explanat; қ vious arrival, in (857 the previous arrival 
i өр nation of (3-1) is that at the time of URS counter. If it уза anes at t. The arrival at t 
Due ж: de Be —1)th iy deme Dum member must be ај 
Қ а queue of т, of whi 
Jum; © Only one waiting. i 
ping dire су to Laplace transforms we obtain (3:3) 
фп(2) = (2), (3:4) 
я mt 2) 
| zi (2) yo» 
фы) = Ел dm? 


еге 5 
Уа) is the Laplace transform of c,()- 
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lizing (3-4) for n > 2, and omitting arguments, we obtain, 
General 


"а (3-5) 
Фа. № Фиги 
т=0 
®,(y,2) = У уф, (2) 
Writing ‚2 


for the generating function of the La 


Тез " а (3-4) 
place transforms Фил, We obtain by addition of (3:3) an 
multiplied by appropriate powers of y 


the infinite difference equation 


(3:6) 
Ф/%2)- ууа) + У; "у (=) Фу, =); 
r21 (3-1) 
= "7 (=) Ф,_1.,„(0,2). 
and for n > 2, ®,(y,2) P 742) Фил (у "Ph 
i i Sti , 
Following the usual method for the solution of this type of difference equation we su 
(3-7), ж" for Ф,. The equation thus becomes (38) 
x = Г(ту,2), 
where 


Г(у,2)- X у”уКг) 


il a(t) exp[—t(z4- (1 —)/b)]at 
0 


(3:9) 
“(2+ (1 — y)/b). 
If £(y,z) are the roots in æ of (3:8) (ғ > 1), 


(3:10) 
Ф002) = УК, у, 2) у, z), 

т 

the numbers К being independent of n. 


Now it is obvious that for |y| <1 


i e £ 
Y| < 1 the summation in (3:10) ean only we cat 
m for R(z) > 0 we find that (3:8) has o das 
and substituting Ф, = KE" forn > 1 into (3:6) we obtain К = y. 
› М S Land R(z) > 0, 
Ф.(у,2) = yEr(y, z). 


(3-1 2) 
4. Тнк LENGTH OF A BUSY PERIOD 
| b; 
Our final objective is to obtain formulae connected with the P.D.F.'s, p, (£) (cf. $2). Weshall denote БУ 
Tm (2) the Laplace transforms of these functions, and our objective will be realized by a formula for 
generating function Il(y,z) of the Tm(Z), i.e. for ) 
441 
Щу,г) = у, у" т„(г). i 
т>1 ісе 
Тһеп Ty, 0) isthe generating function of the Probabilities thegar» 
Й Pm(2-1) that a B.P. concludes after ew 
f the mth i (i кыр 
біле a м while II(1,2) is the L.T. of the p.p y. (0) (2-2) associated with the terminati 
We write ® 39 
A(s) =| a(t) dt, и 
8 :3) 
and d,(t) = Alt) balt). j 
8,(2) will denote the L.T. of the d,, and its Benerating function 
4) 
Az) = у "8, (=) е 
can readily be seen to be given by is 


Aly, =) =|" eH A (5) gp 


5) 
АБА ИШ (45 


Miscellanea 249 
since о 
ў 24464 = (1 аще). 


Now 
ü pi(t) = A(t) e-tjb, (4-6) 
ince t] ар; З А 
tomer D уш that the в.р. terminates in (4,7 4-dt) after the completion of service of the first cus 
; ST У the joint probability that по one arrives in (0,2) and i Я 
is s, ) and that the service of 
completed in (t t-- dt). For т > 2 we have - шаны 


] f!m-1 
Pall) = у b 2 fe-s) d,(t—s)ds, (4-7) 


віпсе, і е 

б at the arrival before t, the (m—r)th customer was at the counter, this same arrival must have 

Mehr queue ofr(1 < r < т- 1), and the whole of this queue plus the customer at the counter must be 
of in the remaining time interval, the last in (4, 2+ dt), during which time no further arrivals can 


tak т 
ake place, The associated L.T.'s are 
m (z2) = д0(2)/0, (4-8) 
and m-1 
SANUS лг) = 1 У bree) Ole) T 
r=1 
B Р 
Y comparing coefficients of powers of y it will be seen that 
(у, =) = [000(2) + X9 ®,(y,2)]/b 
r> 
= уА(6у,2)/0 
(4-10) 


= y(1— £)/(bz + 1— 80). 


41 5. LIMITING BEHAVIOUR OF II (y, 2) 
$ p TNR 
AA (4-10) has only been proved under the conditions RS land №12 > 0, its validity can be 
‘ee include z = 0 if |y| < 1, and, in circumstances to be investigated, if y = 1. In point of fact 
А / i former since it is the generating function of the 


Y, 0) and I 1 : 
ili «a 5 ji rtance; the nction o 
р А овса Pm enim ын: ih in terms of customers served; the latter, since it is the 
ability termination in finite time. Moreover, the value N of 
аП(у. “| | 
is th dy у=1 
е aver, | 
ete Dumber ор гей during a В.Р. өз” 
We us limiting behaviour of TI(g e) dn НЙ, папару for all values of the congestion index p ja 
"попсе with the study of II(1,2). By (4:10) this is - 
2 T(1,2) = а-9/6+1-9), M 
ero г: | 
e É is the smallest root of в =T(a,2) 2a —a)/b). 
we | 
©ап expanq £ into a Taylor series in 2 We obtain ks 
2) = kaz”, 
£a % " 
is 9 
here " Ж өй 
EE dz” ].-0 
> 1-в-82-0 _, (55) 
П(1,2) = bgp 1 — hy a2 OF) 
e expected in 


By 

У the t 

sted’) Ww ier of Branching Processes it 
өсу Stat, P <1, ky < 1. It is evident t 

48) ? conditions, ЕР 


bini e nm -1, Tt follows t 
Y exis. 0 > 1, ko = 1, and 0, II(1,2) > AV iion 
Xist 0 „апа as z > 0. The expansion of £(z) proceeds in non 


n ін) Sofi i 
ii Wh nfinitely long в.р'в. ith z= 
tegra] Dion P == 1, £ = 1 is a double root of (5:2) his азады ш 
ers, but its leading term is unity ағы aa 


is known that: 


«actly as would bi 
(1,2) > 1, exa 
hat as z > 9 


hat a non-zero proba- 
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Formulae for N can be obtained from 


(5:6) 
Uly, 0) = (1—£) y/(1— £y), 

here £ is the smallest root in x of x = a[(1—zy)[b]. 
У. 


ing i e imate, we expan о а Serie: owers of (1 =y); viz. 
o bi iti pand & into a series of ascending р! 
legitimate, 
Assuming it t 


E= X -yy, 
т>0 


а c [e] 
where im T dy” |. 


Differentiating (5-6) and letting y + 1 we obtain forp <1 


(5-7) 
М = (1- h). 


ly 
i t we refer on. 
The limiting process is a little more complicated when p > 1. On the understanding tha‘ 

to B.P's. of finite duration we obtain 


2 (5:8) 
N = (hy—hy) рі. 


6:2) 
Ё = e-ü-£ylp, ( 
This can be expanded into the series 5) 
=? у, (рем) (6 
Uns n(n)? of 
є ; iation 
for instance, by assuming a Taylor expansion and calculating the coefficients by direct differentia 
(6:2). The result is quoted in Takács (1955). 
If (6:3) be substituted іш 


Powers of y. For instanc 


f the 
erved to be picked out as the coefficients © 
е 
ы (6:9 
р =1- е-ір, 
(6:5) 
Pa = есір е-ро 1р), ) 
6:6 
2з = Р Цр) сиу о/р 4. 8/952, | 1) 
6 
Pa = e14 215. 3/2p) — 61 3/54. 4/p? + 8/3p3), ) 
6:8 
Ре 4-3] арз 8/3p*) — е-ро +4 [р 15/2p? + 95/355 4 125/24p^). | 
These formulae can be used for all values of р. 
For the evaluation of (5:8) we have А | (6:9) 
= 
10) 
апа №, = (3p — 2)/2(5 — уз, qe 
8 
Inthe following Table 1 We give values of the first 6, ; i two case 
M/M/1 and рүм, together with = Ve Pm for a selection of values of p in the 
со 
Ур», Ұл, and М. 
Tt will be noticed thatregularizin arrivals has t] Қ ter B ә» 
ее in the single instane, a Е БСҮ T En Te о Increasing the probabilities of shorte ip. is 
greater for the regular than for the random input. Е average n 


Е ite B 
‘umber served during a finite 
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Table 1 егісі uan associated. with the busy period for M 1/1 and Г/М/1 
- Numerical 
q t 
tities associat ith th l; f 
y period fo; M/M] a 
Р JM] 


p 
Di 
Ps 
= Ps 
Ps 5 
Ps 2. 
Ion | D N 
0:95 | 
0-800 M/M, 
0-50 0 | 01280 M 
1-00 "9062 1489. | pi 0-0164 | 0.0073 | 0 
1-50 5000 | +1250 ave | qus | ue 3027 | 10000 | 1:3333 
-4000 | -0960 | pod :0391 2 “ 1:00 
COT | 0960 d -02 hc 00 2. 
d | | ов | Т P: 
- `9 :6667 
3-000 
0-25 
0-98 Т/М| 
0-50 17 | 001 5 
160 | ыт | ово | оов | oona | 00000 | 100 
1-80 6391 | -0973 -0277 | -0124 | -0063 | 9 10000 | 10114 
-4866 a 0465 -0287 4 914 | 1:0000 | 125 
Бе. 0741 .033 0199 +8245 1 550 
4 :0190 0194 зге ШП © 
5 “6667 | 45000 
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INTRODUCTION 


vels only are arly sim) 


Facto 
e regarde 


ria] 
аге wi Periments involvi 
volving factors at two le 


to two Widel à 
Prepari or Uh Аы in consequence. Because 4- or 
Қысты designs n at two levels, these сап be inco! 
ты ооа, many factor such experiments is straightforwar 
2. ito 8 With facto, 3 the search for suitable designs may requi 
© e dis to 256 tr rs at 2, 4, and 8 levels are give? in this paper 
ni bd ты combinations, and use із made of frac 
om Of error, Қыш It is assumed that second and higher-orde 
bet o, 16 possib] for confounding (Fisher, 1935; Finney, 1947), Р 
e, first-order interactions also. 
ion of lower orde 
ictions stated), 


Th T first 
b, ө t; -order i Г 
S fh ables hehe NU has as alias an interact 
en made as complete as possible (subject to * 
which have not been 
; e.g. Yates, 1937; 


Вед, во 
B Teh, те of th x 
ED log, few or їс шн can be improved by modifications 2 
» Kelley & ы are essentially similar to ғ workers; 
oraine, 1948; Cochran & Cox, 195 nt of Commerce, 


rporate 
d (Fisher, 1926, 1935; 

re some time. A number of designs for 
in Tables 1-10. 
tional replication ( 
r interactions can be used 
ut main effects are unconfounded 
fractional replication, no main 


г than the second. 
but it may well 


US. Departme 
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EXPLANATION OF THE TABLES 


The standard notation develop y i in effects and 
i i i 1 letters denoting the maine 
А loped by Fisher (1935) is used, capital e € 
interactions, and те all letters the treatment levels. Where а scheme inv olves EE der e = ia 
S: fir * nd allocated the first letters of the alphabet (АВС for the first S-level factor, DE 
itten first aj d the fir: DEF for the 


d о Vi ors are writtei 8 zai a letters; the 
secon Е level fact itten next, and each is allocated two of the first available le 
ur- 
үа letters (with 7 and О omitted) represent 2-level factors. 
Te f 


" ese give the 

heme is given a reference number consisting of five digits. The first three of т ы iia 

ен 8. level. i devel and 2-level factors, respectively. Fractions of a replicate are indie ord 

a регаты the number of blocks by the fifth digit n; the scheme isa (3) a eec d em 

деш ез rtm brackets are used in the scheme number; e.g. 10(11)63 indicates a scheme for 

factor and eleven 2-level factors, and is a 33 replicate in 8 blocks. 3 е айы азаннан Б 
All the schemes іп any one table have the same number of (equivalent) 2-level factors, 


i ions nded, the 
gives the scheme number, the generators of the defining contrasts and of the interactions confour 
generators of the principal 


в of the 
1 block, the generators of the multipliers which operate on the members o 
principal block to give the 


P isa eror. In 
remaining blocks, and the degrees of freedom available for nat m 
several of the schemes one or more first-order interactions are confounded with blocks; these are 
‘The statistical analysis of the data from an experi i 


are such that the last let; 


1946; Yates, 1937). All the schemes given 
which 6 are to be omitte: 


2 of 
this purpose; thus, scheme 10(11)63 has 14 letters 
d, leaving ABCDEFGH. 


iation 
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Tables 1-10 
aa | А 
Table 1. Designs of type 95 
Sch 1 Generators of Multipli .F. lst-order 
me ig Generators of interactions Generators for | ` E pes is interaBtions 
er | defining contrasts confounded principal block Бізше aro confounded 
| with blocks 
Ра | 
Full-replicate designs: 32 treatment combinations 
00502 5 13 A 
01302 } — | ACE, BDE abe,ac,bd | a,b ( 10 m 
02101 | | 8 - 
10201 ) = | ABCDE ab, ac,ad,ae | а { 6 - 
MN ul | 
Table 2. Designs of type 2° 
Full-renli М ее 
00803 "eplicate designs: 64 treatment combinations - = 
140; | 31 >= 
02203 = ABCD, ABEF, acf, ade, a,b, c 27 | ABCD 
93003 ACE bce 23 DC 
02202 30 = 
0 | 24 =: 
10302 | ad bf,ce | ab 20 uh 
11195 == ACDE, ABDF abo 19 ACDE 
Eu ol | = 
На 
-Tenh | ue 
00613 °Рсаде designs: 32 treatment combinations ab, ac, ае 6 AB, CD, EF 
ABODEp ACE, BDE, BOF) abed, abef , 
906) 2 : i , 8 ed 
0 ib, ac { C. 
tag }аворвр АСР, ВСЕ ad, bf, асе a З 
um | ТЕ Е 
1 C, ib 5 774 
411 }аворвр Amm се а 
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Scheme 
number 


Generators of 
defining contrasts 


Generators of 


00704 
01504 
02304 


01503 
02303 
03103 


10403 
11203 
12003 


11202 
12002 
20102 


20101 


Full-replicate designs: 1 


E 
| 
v 
| 


00713 

01513 ABODE 
02313 aa 
01512 

09319 )4nopzpo 
10412 ABODEFG 


4 treatment, combin; 


ABCD, ABER, 
ACEG 


АСЕ, ABCP 


ADE, вер 


Table 3. Designs of type 27 


ad, ae 
af, ag 
Half-replicate designs: 6 


ations 


abed, abef, 
aceg 


ac, bf, dg, 
abde 


ad, bf, сд, 
defg 


3-4 1st-order 
1 D.F. Е 
interactions Generators for pop at for pone РР 
confounded prineipal block blacks, | error confounde 
with blocks 
28 treatment combinations 
84 2% 
ABCD, АВЕР abcd, abef, a, b, 80 ABG 
ACEG, ABG aceg с,е 76 ава, CDG, 
ABCD 
1 = 
ADEF, BCDE, abd, ace, a, b, = = 
BDFG afg, beg c 74 — 
75 = 
ADE, BDF, ae, bf, cg, a, C, DE 
ODG defi d m at 
y 58 | ADE, В 
70 T 
ACDF, BEFG aos a Da; ab 60 — 
9 og 47 | ACDF 
ABCDEFG ab, ac, a 48 — 
I NN | el 
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Table 4. Designs of type 28 


Generators: of Multipliers | n.r Ist-ord 

Scheme Generators of interactions Generators for E xm = oe : "d md 
number d В inoipal block or other r interactions 
efining contrasts confounded principal blo blocks бутот confounded. 


with blocks 


Pull-replicate designs: 256 treatment combinations 
abcd, abef, a, b, 


00804 
ABOD, ABEF, ( 
01804 ) = ABGH, АСЕН | абдћ, acfh с, 198 = 
09 191 == 
02304 ACH, AEG, ace, bch, a, b, ( 183 = 
“| 04004 = m ау, afgh ea | i 
10504 183 -Ж 
11304 Bre - 174 |ABDE 
12104 BEA, DIa ad, beg, cfo, » b, 164 |BFG, ABDE 
2020 = АВБЕ, aceh 6e 149 |ABDE, АСРР, ВСЕР 
21004 ACDF 137 |ABDE, ACDF, BOEF, 
‘ EFGH,BOGH 
m d ВЕН, BODF, ae, ag, bf, cà, | mb { 170 ЕЕ 
03 } — d abch 
ABEFG 
20202 \ ABDEG. ad, в ET ( d n 
02 |j — : fh, de 
ACDFH 
tog Al replicate designs: 128 treatment combinations "T " 
3 — 
ДІ MIHI 
3413 bdg ab, ас, | = 
DG, | af am 63 
08213 Аворвран | АВОР, АВ abeh, afgh ue ABCF 
1051 АРЕЁ 64 
uas 55 |ABOF, BODE 
1051 67 = 
2 — 
lig af, bg, cd, ab, ac { 57 
М ) 4BODEFPGH | АВЕО, ACDF eh, defg gcc 
| Quang ‘ 
| 00823 P Teplicate designs: 64 treatment combinations 24 ж 
01695 ACDEG. ach, bdg, ae, ag | 18 
m BODPH ACF, AEH deh, abef 
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Table 5. Designs of type 29 
Generators of Multipliers | p.r. Ist-order 
Scheme | Generators of interactions Generators si for other for oe Mens 
number | defining contrasts ere a principal block ЕН error confoun: 
wit! ос! 
Half-replicate designs: 256 treatment combinations 
00914 р en 
01714 ; 
ADH, ABJ abde, achj ab, ac, — 
02514 ABCDEFGHJ d а Р. d 180 
03314 BDF, BEG adf j, aegh ad, cf 171 — 
04114 162 CGH 
04113 ABCDEFGHJ ADH, AEJ, abde, achj, adf j, ab, ac, 169 E 
BEG aegh, cf ad 
10614 170 = 
ADJ, BDF. abdg, adfh ab, ac. 2 
11414 | -ABODEFGHJ Ч ахат nic. 160 | ACDE 
12214 | CFH, ACFG bcef, bdej ad, ae 150 ACDE, ACFG, 
DEFG 
11413 ) AEFG, CDEF b T 
ABODEFGHJ , b abcf, abde, abgh, ab, ac, 167 
12213 ABDEJ acdg, bj ad 155 = 
20312 | ABCDEPGHJ 4€DG,BOEH | ad, ag, be, bh, ab, ac 141 = 
fi, abef 
Quarter-replicate designs: 198 treatment combinations 
00923 ме 
01723 || раған, ДЕ 
02593 ворка; eo BHJ, beh, СГА af, ah, 68 = 
03323 achj, adef bc 60 
51 v 
10623 ABDEFH, s 
BCOFGHJ 48 "i бин, bs bdg, ad, ae, 51 | FJ 
10622 | ABDEFH, | 4 EF. 2% 0080 gj 
BCOFGHJ ega E д bdg, саһ, ad, ae 54 s 
2, 9) 


baana — 
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Table 6. Designs of type 210 


Se] Ge tors of Та р 
ine Generators of intéesakians Generators for | МАЁР + ub T вы 
г | Че ing contrasts confounded principal block for other xe ee 
with bloda blocks error confounded 
Quar е 
Arter-replicato designs: 256 treatment combinations 
185 = 
p" қ 177 - 
EGHJ, ACE, BCG, bcej, Уа), ad, ae 168 — 
BCFGHK ADK, AGH df jk, abchk aj, bf, 158 = 
148 AGH 
(Ерен) ‚ (СЕРУ, ОРИК, | миле аба, ( T = 
BOFGJK ACHJ bojk, cegh ah 148 = 
164 = 
ACDGHJ, ADF, BFH, сј, вд), adeb, | аф а } | 159 — 
BOEPJK BDK aefh, bghk ah 139 — 
Teplicate designs: 128 treatment combinations = 
AEFGJ, r ae, ck, = == 
BDGH, ACE, CDK, fg, аса), ^d р 
J, hj, ас dh 49 ерк 
BOPIK EHK aefhj, 
г 40 La 
ABD OPH, ADF, ВЕЛ аа, bej, са), bf, cj 
BOFGH;, egh, af jk 
4Воррјк 
Biom. 46 


00(12)44 
01(10)44 | |4СЕНК, 
02844 | | BCFGJ, 
03644 || DGJKL, 
04444 |] FEJEM 
00(12)43 
oaoa | |405НК. 
09843 
un DGJKL, 
2% FHJLM 
ACDEFHK, 
10944 || BODEFGJ, 
11744 CDEJKM, 
CDFGHL 
ACDEFHK, 
10943 BCDEFGJ, 
11743 CDEJKM, 


Table 8. Designs of type 91 


Sixteenth replicate designs: 256 tre: 


CDFGHL | 


atment combinations 


AEL, AGJ, acegk, adefj, 
BHL, LM abofjtm, 
abcdef gh 
AEL, AGJ. gjm, bohm, 
BHL acegk, 
adefj, abefgl 
pu CDF. bjm, aeghk, 
, BJ cfhkl, edeghm 
ADG, CDF bim, ehjl, 
CGK aeghk, 
МА, cdeghm 
— e не 


Miscellanea 
258 
Table 7. Designs of type 24 
Generators of Р A Multipliers | D.F. р м. 
f interactions enerators for for.other for inter: 
Scheme | Generators o d incipal block :onfounded 
ing contrasts confounde princip blocks error | со 
number |деһши@ 0 with blocks 
—— НЕН 
Eighth replicate designs: 256 treatment combinations 
ACDEL, ADH, BCE. ahjl, acdeg ae, Uh, 174 = 
0001034 | | popu vum VI: cefile 5 = 
> E. befgl, k , abd 165 
01934 OFGJKL BDG, DEK cfgl, се) Ja 
02734 | ACGJK, FK, CEF, ahjl, abdgl, aj, bd, 156 | FK 
EGHKL, DEG, EHJ acegj, def jk bf, acd 
ABDEFGJ " 
03534 | ABCFKL, ACE, ADH, седі, acfhk, bl, ck, 145 | ABK 
ACEHJK, AFJ, ВЕН bdghk,abchjl| dg, ej 
ADEFGJ 
01933 А 173 = 
02733 | |4FHJK, | BBL, BFK, UAR BID. aj, bd, | 163 = 
03533 ВРЕТ, а ny aaah, ce 152 = 
08238 || OEGHKL af hk, abcef | I = 
10833 ae ADF, BEG, af, bghj, арм, | bk, oj, { 149 = 
11633 ACDFGJ CDL egjk, chjkl dg 136 = 
Sixteenth replicate designs: 128 treatment combinations 
ACDEH, 
00(11)43 | | BODFJ ACJ, АЕЕ adehj, j j = 
d , , ij, bcef j, acj, ad f, 54 
01943 ABODGL, BOG hik = 
BOEGJK фу Bed 45 


Ex 1 — | || - 


163 | 
ae, abc, 153 
adk, м | (2 
gim 130 
118 LM, GHK 
170 5 
ае, абс, 100 Бе 
edi 149 pe 
137 
194 m 
BJ, Еб, РЇ’ 
cd, cel, 129 | ACH 
ыл EG, ГІ» 
ul 116 p on, ADE 
ed, cel, || 133 T 
acjl { 119 = 
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Table 9. Designs of type 213 
Generators of ipli e: 1m 
Scheme Generators of interactions Generators for perd es Neo sl 
mber | defining contrasts conta principal block sale Namen Ud 
with block: % j 
Ht 
Thirty-second replicate designs: 256 treatment combinations е 
ACEGK. | 
00(13)54 \ j, bdeghl. acn, bjn 149 — 
ADHJN, acefgj, bdeght, , jn, 
0101154 ABCOTA ACM, ADG, cefhimn, dfn, 138 -- 
02064 || acperm, | 4Fb.AFN dfghjkn ав ^ | 126 т; 
ACHKLM 
ACEGJ, i 
03753 BDEKN, adgk, bafm, acl, cek, ( 121 M 
04553 |] Бором, |402, ВОР, ehjk, d 108 |GHK 
ABOPHK, | ВСЕЕН cdfjn, cghlm 
ABDHJL 
10(10)53 ADHMN ADE, AFH, bden, clmn, алыш, 117 |GJ 
CEGKN, ВЕР gikl, ақ 
BODEEN. def mn, 
BOGHJN, abcdf ll 
BCKLMN 
Table 10. Designs of type 24 
Sixty-fourth replicate designs: 256 treatment combinations A 
0014963 ACEHJ, Ee 
01012) 6з BCFGM, achjm, dehin, | cfh,acfe. | | 119 | VGP 
(1064 | | ABDFHK, | ACF, ADH, glnnp, abdfin, | “ар ne 
03863 FGHJKL, GP of jkin 90 
04663 JKLMNP, 
1011 A cfghn сір, hmn, | 99 m, 
368 | ADEFG, ADP, BDK, p Todi, | afkm 
AGLMN, JM ifhkp, 
BDEJP, egjlmn 
BEFHM, 
ODFKP, 
ABDEKL 
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Confidence limits in the case of the $eometric distribution 


Ву KERMIT С. CLEMANS 
U.S. Naval Ordnance Test Station, China Lake, California 


INTRODUCTION 


1 rials before 
is paper presents T ence Бе! г imati ber of trials 

рар ting т, the expected num 

Thi ts central confidence belts for estima je: 6xpet 2 

failure, when umber of trials before failure, has a geometric distribution 

ailure, when X, the numb: 


Р(Х-4)-рч (b-0,12,..) 


where p and q are positive constants with P+q = 1. The expected value of X 


E(X) = plg im 
hence p = m/(1+m) and q = 1/(1+m). 


; ceptance 
This model is of interest in estimating the mean life of ordnance devices that have passed accept 
tests and have an operating life th: 


s X 
m= у, XN. 
ізі 


N 
Tt is further known that У, X, has a Pascal distribution (Feller, 1950, 81153), 
ізі 


N t 
Pe( àx-s- Wis ose (k= 0,1,2,...), 
i-1 


where р = m[(m-- 1) and 4-1/(т--1). The distribut; 
and m(N— 1), E(Xx, 


) = рід = Nm, 
var (%) = mm + 1)/N and 


that is 


ion is bimodal w 


1 
- C 
ith modes for £ X, = m(N 1) 
and var(EX,- Np[q? 


PIS = Ms 
=Nm(m+1). Then Е(т) 


coefficient of variation of — a ex a) oc 3s 1 
m 
The cumulative Pascal distribution is 
N k P i N 
қы-»(% X,«1) s [^ ү а жа 
ісі j=0 j т+1] \т+1 
Consrruction Амр USE OF CONFIDENCE BELTS 6 
The belts of 1 — æ confid, i SM : ndin 
values k, and k, aes TEENS SEE are determined by assigning m and then finding correspo 
T() < а < Е(к, +1) 
and T) «1-345 < F 
For the pur 


ng curves a hk’ f, r inte 

polation between k; and k;+1 Бай the points em mal ( +; h me ie c cei b» 

The summation was done by the ІВМ 704 Computer and the pro, oe и 5. =: lable. Belts 
different confidence for the Sample sizes used in this article are н ен fi m Charts Ta” 1 
give two-sided 90 and 98 % central confidence limits, respective] “ing! aca ber o | 
before failure, as a function of kiN, the observed mean numb; Fia roy the mean num 
Тһе charts also give one-sided 95 and 99% ur tial 

Tn general, these c. 


р fsize 
b i i mples О: 
confidence limits, S before failure in samp 
lts can be used in + 
(1934) or Crow’, 


t 
he same wa $ 5 | fidence bel 
S tables (1956), to estimate Pr ад 5 binomial con 


à : ance 
imits or to plan in adv! 


s of 
the 
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Е M 2 : P. 
10n of the two curves for N = 30 in Chart I an e abscissa k/N = ье. 
t f N 0 in Chart I and the ab: k/N = 500, 385-700. 


INTERPOLATION AND EXTRAPOLATION IN THE CHARTS 


Inter olati O Si 
polation with respi i 
К ect t ample size may i i | 
mean with arguments N-1, = | "i ' н 


Extra 
polated i D А 
t equation 4 confidence limits for the population mean for sample size N > 100 are found from the 


logm = log (k/N) + 10N-4 |log mo) — log (k/N)], 
ere i " 
rc дагы the corresponding (upper or lower) limit for sample size 100 at abscissa k/N. 
Polated population mean confidence belts for large sample means (k/N) are | 
- жт (ХМ) = (1000/a) (k/N) 
с 
м my(N) = (1000/b) (EN), 
"her s 

and o т wes b aro the abscissas at population mean 1000 for the corresponding sample size №; see Tables 1 
log b 719 obtain values of a and 0 for values of N not tabled, interpolate linearly on values of log a and 

› using N-! as argument. 
Table 1. Value from 90% confidence belt 


4 5 10 15 20 30 50 


34141 393-6 542.3 616-2 662-6 719-7 779:2 8414 
1460 1395 1320 1245 1173 


1940 1832 1572 
2.42 1:99 1:21 0-938 0-793 0-632 0-480 0:336 


Table 2. Values from 98 % con, idence belt 


20 


700-5 782-1 1000 
1366 1259 1000 
0-696 0:484 0 


4197 498-2 553-9 
1883 1702 |1598 
1-42 1:18 


05-3 | 2554 
16 2326 
447 3-49 1:89 


% confidence interval for the 


Br 90 
ian essary to em? mao the magnitude of the distance 


Ple 3. Find the size of sample nec 


ns 


. This mea 
he sample mean 0-500 of that abscissa. Find c in 


Populat; 
10n mean is -50 of t 
is not longer than 0:50 scissa should be c 


Acros 
Таы ыы % confidence belt measured along an а 
61, where 11 

c = 1000 (-5). 


belt. А ү 
vs estimate of the sample size required may be 


anda 
and b aro the abscissas at m = 1000 of the confiden 
ments logarithms of c. 


foun osest value isc = 0-480 for № = 50. A more ассига 
Y interpolating between values of N -і, using as argu 
log (0-500) — log (0480) _ 9.148390, 
log (0-632) — log (0-480) 
50-% = 0-041153, 


30-%— 
= 0:147528. 
This зра = 50-1+ (0-041158) (0-148390) = 01 ken as 46 
v ize is take . 
alue of y- is between 45-3 and 46-3; hence the sample st 173 
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Note regarding Statistical Research Monographs 


я eries of 
'The Institute of Mathematical Statistics and the University of Chicago have established a ser 
ications entitled Statistical Research Monographs. у — 10 
us primary purpose of this series is to provide a medium of publication for material ке aie 
statisticians that is not ordinarily provided for by existing media. It will help fill the gap betw 
articles and textbooks or treatises. Among the kinds of publications envisaged are: “il һауе ample 
New research results too lengthy for the usual journal article. In particular, authors will he 
scope for detailed exposition of their findings. wg of such 
Research results of interest in both theoretical and applied statistics, At present authors ne 
material frequently find it necessary to publish part of their results in a theoretical journal and par 
applied journal. 


Expository monographs in particular areas of statistics. 
Discussions of statistical 


problems and techniques in particular areas of application. 
Every attempt will be m; 
The Editorial Board co 


ade to maintain the highest standards of scholarship. ү Cochran 
nsists of David Blackwell (University of California), William 1с {ме к= 
(Harvard University), Henry E. Daniels (University of Birmingham), Leo A. Goodman (U тт ‘anc 
Chicago), Wassily Hoeffding (University of North Carolina), Jack C. Kiefer (Cornell University 
William H. Kruskal (University of Chicago). 
Authors are invited to send manuscripts and 
Department of Statistics, 


" {mans 
correspondence concerning the series to Leo A. Gooc 
University of Chicago, Chieago 37, Illinois. 
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REVIEWS 


The Logi 
e и : 
of Scientific Discovery. By Kart В. Роррев. London: Hutchinson and Co 
: ап . 


(Publishers) Ltd. 1959. Pp. 480. 50s. 


Prof. P 

+ Popper's Logik der 

à most impo gik der Forschung was first published in 1934. ў 

T t М . It was 

statisticians a book and has had a profound effect on аа ори 

оп Scientific Me wie perhaps those who have been lucky enough to be cuts to a EE x ааш m 
Method at the London School of Economies. Rather surprisingly, it = m RUE 

> previously 


be encouraged to tackle the original work. 
under which the study of a mass of factual 


0816’. He is. ; Ё 
He is, in this book, mainly interested I © ve testing of a theory or а hypothesis. 
nly if it is capable of being tested by experience. Thus the 
hat it is testable or falsifiable. 
when it is, Popper says that, of two 
But this is а kind of probability, 
bility to the statistician and is not to 
statement forb. 
© smaller і чен 
Vents, ler its absolute logical probability because, to put it shortly, 
, Po : 
inte; олы then equates the degree of  falsifiability to the simplicity ofa hypothesis, and has some very 
ing comments on the concept of simplicity itself. Thus he arrives at the schema 
robability = paucity of parameters = simplicity. 
rinch, who effectively would have 
paucity of parameters = simplicity. 
o be heading for trouble. For if the para- 
bability of any particular hypo- 
t up his method of approach to 
фа hypothesis has а non-zero 
has no value or validity in 
but with their degree 
and to increase 


"This ; testability = high prior imp 

is 
to be compared with Jeffreys and У" 
high prior probability — 


At this ро; 

t eterg i үне as he himself recognizes. Ро 
hesig is ze hypothesis may take an infinite ве! 
Prior prob хор and, in fact, it was to avoid this difficulty t д 

Probaj it ability. But Popper has an answer to this. The «coi 
cience ИУ із not itself testable and, whatever its metaphysica! У 5 
" » whate YO eh the probability of ee oh 
its qa бота | Р ‚ to test it more and по е, 
8 de ton. It is possible to go on learnit pe ol pro hability, which м с. - 
ene poration (Bewährung): de AS У Mite 
-L tion, but this expression he now discards as mis- 
сопйттайон including Carnap). Popper distinguishes 
elation to statement, 


itics, In NUM 
в of bis wing: ical probability” in relati 2 
су interpretation of 


pper seems t i 
t of values the prior pro 
hat Jeffreys 8e 
on tha 


In earlier 


be, ПВ (and, ; 
atten тру s fact, as having misled som Шо 
obability' in the sense of relative frequency: | “4 relative frequen 
as accepted о formulating his propensity theory. Within 
by f two frequential problems: 


Drop, СОГ” i 
ОБ, оао '. In statistics he has 807 wpoin 
(а) ате, ough he has since modified аи to dispose 9 ifyi 
) the Work of frequency theory he has n the «e tan nderstand it, by modifying the field of 
this һе 4068» 


robi 
lem of the random sequence; 


266 Reviews 


ti bout 
ing i - oblem that no assertion a! 
possible tests for randomness by making = еа 5 ed qm end ee cine uis he 
ility i again puttin ания = 
Е сална iá pe cm eue s eh d'Alembert's, that events of small probability simp'y 
overcomes by an 
5 in Nature. . E auch the 
x Ven и ое. him, he can proceed to а theory of corroboration. It туга T а ефе 
ber ing i i i f corroboration as the severity | 
tances which determines the degree of со! з simpler 
"sdb per den i i the falsifiability; and hence the 
5 . The greater the possible severity, the greater the ! yi S linge ү 
Heidi аа more ate she The degree of corroboration of x by y is then not the eim en 
Ур БАБУ of z given y; and in fact it is possible to construct examples in which 2, is strongly $ 
р у and =, is strongly undermined by it, but for which Р(а,|у) < P(v,| y). m —" original 
Pesci, the degree of corroboration is defined as a function of probabilities. Pop] Б 


sistent 
method of doing so may, perhaps, require modification, but at least he showed that a con! 
system obeying his axioms is possible. He thus measur 


рев 
es the rationality of accepting the айева ЭН 
generated the hypothesis, and this seems to be about as far as numerical evaluation of the accep 

Ға theory can go. стат, 
° The bosk um а good deal of technical matter which will be mainly of interest to the bene 
and some replies to criticism which will interest only those who are aware of the work of ше un thi 
critics. Moreover, a final judgement of the System must await the sequel volume. Nonetheless. 
book remains an outstanding contribution to 


zai. ааа tne: 
the problem of the utilization of probabilistic ideas 1n 
evaluation of hypotheses and will repay detaile 


xu Lb 
d study. м. G. KENDA 


a th 
Experimental Designs in Industry. [Symposium held November 1956 at Nor 
Carolina State College.] Edited by V. 


Crew. New York: John Wiley and Sons Ine. 
London: Chapman and Hall Ltd. 1958. Pp. xi+268, 485. 


and response surface design; а 
This is a book wh: 
learn Something of the 
book (typescript with a number of surpris 
than is justified. ть is also quite inexpens 


1 "T 
ishing 
ous students of statistics and others Wish"? g 


ро 
ental design, It is an unprete” oot 


N 


| 
| 


з 
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Analysi i i i 
45. pecu Time-Series. [No. 1 of Griffin's Statistical Monographs and 
s.] By М.Н. QuzwouiLLe. London: Charles Griffin and Co. ud dub 


Рр. 105. 24s. 
This is tho first in a seri 
K series of monographs on specialized topics whi i i 
T zone was reviewed in the last imus of this rela jen ані 
" ans i s book ш remarkable in that much of what it contains is new work developed over 
enim н а t period, The author is concerned with an extension to stationary multiple series of the 
Seoul es pa porties, which are well known in the case of single series and also makes a start with the 
‘Sante, о problems, confining himself almost entirely to large sample theory. Specifically. 
Жашы vhich are considered are linear of the form H(D)X, = G(D)e, where H(D) and G(D) pm 
“екенін T elements are algebraic functions of the shift operator D, X, is a vector of p elements 
Жалан ше xd he simultaneous values of p series at time d and e, is a p x 00 matrix of uncorrelated 
Sitting ih х ending backwards in time. One would have liked to see at the outset a statement con- 
Fw e uniqueness of this representation in the spirit of Wold's theorem for single series. 
ле ease where G(D) =I, the unit matrix, the correlation properties of the X, are determined by 


the r Ü 
roots of the determinantal equation |H(D)| = 0 and these may be real or imaginary in pairs. It is 
d if a transformation is made to a canonical form in which 


Su vs much simplification is obtaine 
ТЫШ T d agonal with elements which are the factors of |H(D)| =0. In the сазе where these factors аге 
iiec e transformed model is that of a multivariate Markoff scheme viz. AXI AXE, = ей and 
Cha re given special treatment in what follows. | А 
iinet 8 is concerned with the problem of Identification, namely, that the parameters of the scheme 
salia Us uniquely determined from the covariance structure. This is true even for the model cited 
seii т, but the situation is made more complex by the effect of superposed error. In the case of a single 
Ci 8 this is equivalent to mixing the series with an independent random series of unknown variance. 
early, one could make the situation more hopeless by adding an autocorrelated series with unknown 
that the two series could not be separated 


autoe A 4 
nb aera structure, in which case it would follow 
hout further knowledge or specification. А 2 
Chapter 4 deals with inference problems and the methods which are developed are illustrated by 
ies. Chapter 5 is devoted to miscellaneous topics and is 


m 2-2 : 
Means of artificially generated multiple ser! 0 n 
y be complicated in practice due to the omission of 
superposed error, non-linearities, trends 


ры to demonstrate how the analysis ma 
vant variables, moving average residuals (1.0. G(D) € D. zor NEA 
and short series of observations. A very interesting method of trend analysis is proposed in which the 
differences of the correlations rather than those 0 iginal series are used. Next follows chapters 
y a last chapter in which a practical example (five 
eveloped earlier. 


O; Ke . 
n estimation and goodness-of-fit tests and finall ; 
to the various techniques d lier 
ground, but it is necessary to draw 
f criteria of the form C,C,_,, 


TUR economie series) is subjected to t bna 
attenti ии aoc jalopy "€ exp orally. Thus the use 0} 
10n ] i d uneritic . , vt 
Where С, is ds мең їн n akioga. ipi 06 езі to lag 8, аз а means of deciding on the order of the 
d covariance matrix corresp ago residuals. In the case of a single series these 
e yond the kth for a kth order 
The chapter on estimation 


Sch $ i 
m i me is clearly unsatisfactory for moving ave ево are zero be 
atrix quotients are the partial serial correlations and = imr seris 
x iduals are mi д Р ; 
if the residu "er serial correlations form a sufficient 


autore " 

gressive scheme but never zero 

COntaj. ies А shat the low 2 ae 

ins several surprising statements, for example, pueden scheme. Again, it is surprising that the 
ters of a moving ace, despite the fact that in 


Set for t} : 
пе estimati ; i 
Dion ofthe paame ial correlations is given such little sp: 1 t у 
ial cor ficance tests and estimation. The relations 
but not very useful since 


effec 
emai] of mean corrections on the ser рі 
Samples they may have considerable E 8-50 are interesting, 
s given on РР- idet continuous" scheme governed by a speci- 
ensure that there are no 


etw 
Fei un continuous and discrete schemes ipee 
Пед rarely іп a position to say that one 18 sampling Iveis it is necessary to 
oscil, tochastic differential equation—in spectral ana У P liminary filtering where necessary. 

ations between the sampling points, using some pr ҮНӨ арргоаеһ to time-series which is pursued 
Шала d to the author's interest in particular 
this it is possible in 


к point which may occur to other readers F ere 
Subj 8 monograph is entirely empirical—this 18 no do ossible. In contrast to ae ior 
e cce ne matter fields where no other approach e қ uch away that the problem of identification 
. “rical Н В көрү ify the X,8nd € 77. e :ngful or else as an explorator; 
арада “aie ый applications to spe ita itis physically zone у 
- Also, a linear mode ; i 
ook before more realistic non-linear modals ns We overall ө Шел of Yt p m m 
Weve tioi 3 ‚ау detract ir МЕ Һойз. It is no redic 
can we sana wy Seen а 
at it wig beans m standi book worthy of serious study 
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raphs 
iriffin's Statistical Monograp 
isti i No. 3 of Griffin’s Statis 

ls of Statistical Reasoning. [N ffi 

Fundamenta 


958. 
iri 1 Со. Ltd. 19 

v E don: Charles Griffin and 

М.Н. QuENoviLLE. Lon 

nd Courses.] Ву 

Pp. 169. 245. 


i will 
atistical calculations 

4 i f how to carry out statistica 4 jr aln 
Я a step-by-step explanation о! statistic ulation | 
‘The practical = а эта find proofs of the formulae inv oem n det 
not find it po Cuban however, be interested in the DE sy уе — Tee 

nier acti ; is aimed at him.’ (From the вов) 
procedure: or practices, this monograph is aime | | hors рге 
rather n ы Sino present reviewer was somewhat disappointed to find the 

After this in! ion, 


tical 

positi the mathematice 

к 7 А voted to ап exposition of car 

M EM Methods) have been devo: arson and Ban 

parts е ре енең theory, such as is covered by the first part of apu tM a 

concepts dice in Research. There is very little discussion of the relation о ho eee р 

ннан practice іп the real world. The treatment is superficial and in parts inaccurate. 
Б 


"ase 
a in the phras! 
t suffice: thus ‘independence’ (of events) is never defined, and the word is first used in 
mus! H п 
‘independent exhaustive possible 


A great improvement is а) 


develop his line of approach. 


d 
" " 4 timulate 
Professional statisticians (who should be both mathematicians and practical men) will nee men are 
by this final part, and plain mathematicians may be intrigued; but Mr Quenouille’s prac 

likely still to require сопу 


ir problems. 
incing that the theory outlined has any relevance to their problem 


/8 
с. 1. MALLOW 


‚ and 
S NUS д Я : Harper & 
Multivariate Correlational Analysis. By P. Н. DuBois. New York: На P 
Brothers. 1957. Рр. 202. $4.50. jo to 
" itud 
Statisticians of the sophisticated sort are nowadays inclined to take a condesnending on о 
correlation analysis and to attempt instead to operate as far as possible within the fram 
Tegression and variance analysis. Indeed, it is h: 
them if Fisher had ne 


ic 
mat? 
aticians, is to воб out system 


e boo, 
1 esting. Consequently, although he tisti- 
al research workers it is not likely to be very helpful to 8 


regression theory 
author shows ho 
and shows how 


aoe уотатУ 
ties involved. The cus e 


A со! 
i St the one which is most highly ghee! 
In combin 


Reviews 269 


regressio, 
n sum of s 
а Systemati squares, and so on. This thod i mm: i 
hatie computi method is recommended in the book Д 
a e ы ook under re an 
puting procedure is given. As the author points out, however, not E Ns wana 
Ц пеге по 


Buarantee that 1 b: nstruct hi di 
lat the will yi y i) uct cases where 1t ае; 
1 method will yield the best su it i ly 
: E е yield tl set, it is easy to c 7 it definite 
ев not. A defect of the presentation is that no rule is given for advising when to stop adding furth 
g er 


Independent, vari 4 
Where na variables. In this respect the presentation may be com ared i 
stopping 2 of Summerfield and Lubin wegen d 1951). Setup ee = dra 
WES tha я t е contribution to the regression sum of squares, due to including a further a m 
the Банкир estimated residual variance, while Summerfield and Lubin recommend stopping wh z 
guard d e regression sum of squares is non-significant by the F-test. As an E. 
Squares given b s d and Lubin suggest checking that the difference between the regression sum of 
1 by the reduced set of variables and the regression sum of squares given by the complete 


set of ind 
à ependent vari i i 
obvious TA nt variables is small enough to be attributable to chance. These suggestions 
m Р , stio; 
merits and it would have been helpful to have had them discussed. Se 
ful in that it ties it up with the earlier treat- 


Altho 
ment лала account of factor analysis is use: 
thi elation and regression analysis, it seems altogether t i БЕТІНІН 
is mysterious iei gn ysis, i gi r too short to give one a clear insight into 
* J. DURBIN 


Intr А 
Oduction to Statistical Inference. By J. C. В. Lr. Michigan: Edward Bros. Inc. 


Tig 0587. Рр. баз. $7.50. 

е қ 
HL татары state that this is а non-mathematical exposition of the theory of statistics written for 
Necessary fi scientists. The mathematics are in fact kept to & minimum—there is just enough to enable 
theorems ormulae to be written down succinctly and no attempt is made at proofs. The necessary 
In this fa: T stated as articles of dogma and sampling experiments are used to illustrate their validity. 
(E rige Prof. Li covers the distributions of the sample mean and variance, t, F, the variance ratio 
ànalysi n, the one-way classification in the analysis of variance, randomized blocks, linear regression, 

318 of covariance, factorial experiments, transformations and distribution free methods. 

inimum aimed at 


Th 1 
hous = of exposition is remarkably consistent and close to the mathematical mi 
shout. The writing is clear and the ideas are set out with simplicity. The reviewer feels that the 


Pauci 
ean "d of the data of experience may be а handicap to the experimental scientists, since they tend to 
eavily on actual data rather than fictitious. However this is possibly a matter of opinion, and 


Within its lim; 
in its limits the book can be recommended. F. N. DAVID 
Те | а 

ach yourself Statistics. Ву В. GOODMAN. London: English Universities Press Ltd. 


1957. Pp. 238. т. 6d. 
atisties for readers with a min! 
го statistics nan ен 


As its t 
8 title j ; Р 2. т š 
(Ele implies this book is an introduction t И 
mentar : + advanced techniques 
вери: y calculus and determinants are the most a d epa à 
ut o ind are used reference to appropriate texts is often gd Low " Dea atime ry 
Th 9 casual reader will d a desk calculating machine by him 1 he is em himself. 
E 9s field covered is the is е e tm inimi ў presentation of data, measures of frequency distributions, 
sion ite 4 probabilities, binomial, Poisson and univariate and bivariate M imum eie sd. 
T ; a 2” i ion, samplin; 
correlation, rank correlation, least squares, partial correlati pling analysis of а, 


ete, i 
testing, mentary cctimation, the 4- and F-distributions, elementa ates 
L " " » ы Я 
n regression analysis, X?-tests of goodness-of fit and conting' tary texts (Yule and 


Wh; 
Kendall the topics and the order of ргевеп п older ee tae The author 
Sing fae, Prings to mind) the mode of presentation E caret of probability to the 
Praet; ed with the fundamental problem in such @ book, d hardly have been unusual in an 
ele. Ж, Of statistics, has gone back to ап арргоас an т u (pp. 34-35): 
Е ТУ text of the early nineteenth century. For пу probabl 
eir the situation is such that we can distinguish ^ 0 x 
ble outcomes 9^ f tho probability of Æ given C, p(B; С) 


imum of mathematics. 


instance. 
— d е” outcomes of a set of 


"i Ққ ы C and if these are all the possible o oe 
à Biven D Of an event E we say that the appropriate esum 
y жЕ; б): т/п. . 
ances С the event E has occurred m times the 


set of circumst 
f E given Cis 
gun; в) = min? 


(2) 


Oth 

а ү 

горна, түгіне, if in n oceurrences оға 
© estimate of the probability о 
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again, 1 1 е SCO} {150 el zl. e stat n p. 43) 
n he states (о: 
N trials, the score v; is obtained n; times, i = 1 4 
Ғіп А М 
And ag : 


< E 
*E(x)— lim | У, wax | 
Noo Li=1 
(allowing for what must be a misprint). 
al 


% be 
h ; ay $ must 
i F liseussion, 1 

Wh such ments as these are to some extent mitigated by imm yiri ie 7 Norconfuse 

d a m is " re ed are often not only loose but such as seriously anc a 

ж. ents us | к! 
said that the аг, versae 
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in the reader, 

venial and in any case unavoidable. Indeed 
eni: 
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Concise Tables for Statistician 


“otictical Center 
S. By К. C. S. Рилат. Manila: The Statistical 
University of the Philippines, 1957. Рр. 50. $1.50 plus postage. 

This is a book of eighteen short tables on 33 

The tables fall 


Е »ralization 0 
ne sum of roots (table 17) of the matrix generaliz 
ultivariate analysis. In the ] 


and Complex Var 
Publishing Company, 1958. Pp. 4 
This book a reprint, in one volume. 
the theory of functions which 
some others, 
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An Introducti 
uction to Combinatori 
Wiler a orial Analysis. By Jonn R: ү. 
y and Sons Ine. London: Chapman and нші Ltd MSS d уо ы 
d . Pp. x+242. 685. 


The 
great name i i 
Books e in combinatorial analysis 1 
бз GE и i nalysis is undoubtedly tł і 
knitting саш E ingenuity and combinatory insight Үй hohe Em ced 
Work in the seres early work of De Moivre, the generalizations of La een PELA ERAN. 
generality of the инш ы theory. The difficulty, and it is a real опе A ади rom je elementary 
piece of theory heory because although it is all there it is not т у E 

Тһе work of med : — in any given circumstance. PRUDENS SEHEN 
desi к - A. Fisher in the combi i i 

Es Has т ар З mbinatorial field, both іп samplin i i 
mowledge ens MEM that the worker who would тен еа e 
mahon: Р f mbinatorial approach. Previously th у i ux M 
dle ch. | ly there was no alternativ 

ых: ior dan has now put us all in his debt by writing a treatise which wil aa wen hoe 

The cem iae рет and to a certain extent will supplement his work. Nic: 
Sometime overed are permutations and combinations, generating fi i i 

да оп k jing functions, De Moivre’ 
tributed to Boole and often to Whitworth, cycles of permutations, Ее 
P itions, 


compositi 
ions, tree: i 

, trees and networks, and restricted permutations. There is no tie-up with statistical 
Prof. Riordan’s book should, however, be made required re om 


met] : 
Tor on o: probability theory. 
is high ops to work in the field of so-called non-parametric methods. The standard of expositi: 
Which fills a 1B Very easy to learn from. In the opinion of the reviewer this is an excellent es Rs 
gap in the mathematical armour of the statisticians. Аа 
Е. М. DAVID 


Calc 5-4 
ulus of Finite Differences. By GEORGE BOOLE. [Edited by J. Е. Movrrox.] New 


" 

без York: Chelsea Publishing Co. 1958. Рр. viii--336. $4.95. 
қ ii in 1860, when he was professor of mathematics at Queen's College, Cork, wrote an 
“ollege e M on finite differences. After his death his book was revised by J. Moulton of Christ's 
lishing Cor mbridge, in 1872. ‘There is no record of the date of the third edition but the Chelsea Pub- 
This e per note that the present (fourth) edition differs from t i ional changes only. 
Stood the t Ky refreshingly free from the modern trend towards unnecessary mathematical rigour, has 
Sön ek гра of time well. Generations of English mathematicians, statisticians and actuaries, have 
hose who on it, and that it has now crossed the Atlantic can only be a source of satisfaction. For 
Ure, Su. are not familiar with ‘Boole’ the topics covered in the first part are Interpolation, Quadra- 
The dà mmation of Series, Bernoulli numbers, Convergency; Boole's limit for the remainder of a series. 
nd a rage part is devoted mainly to difference equations. There are many examples in each chapter 
of answers is given at the end of the book. To re-read this book after a number of years is 

de in manipulative mathematics. 

] approach is simple 


о 
есо: 
% еее © aware of how little advance has been ma med 
and Р сап be recommended to students of all types and ages. ТЕ чи ш ed PUn ale 
16 amount of A "uae faible. At the price, it is—by icu on 
ал о қ 
"Bain field. prerequisite learning із neg& в. 


А А T 
Treatise on the Analytic Geometry of Three Dimensions. (Seventh edition.) 
Vol. 1. By Gzoncz Зльмом [Edite d by C. E. Rowz.] New York: Chelsea Publishing 

been followed by many 


Co 
+ 1958. Pp. xxiv-r-470. $4.95. 1 
5 ‚ В. A. Fisher has 
stics by R The generalization from 
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deri 'odueti қ А 
ivati ion of the geometrical method in statistic’ | h. 
ihre 5. ала proofs in distribution theory based on this type of АРЕ that it is done at all makes 
ai neces dimensions is not always skilfully оғ accurately eU be nodding acquaintance with three- 
ime, . sity for th ee have at east & 2 Ex. З 
е nt to hav Н iewer’ on there 
ity Tor Ше modern rinde Pr d ia s o gn 
Tug, Why this, the seventh S of Salmon’s book, should be preferre to them. , 
book. ut it was revised in 1911 for the fifth 
have been 


SO b 
© remarked that this is an excellent weds b 
is not state аав; who appears to 
ie sections 


inted. ] 
= wrote his treatise on соп! 
bout that time. The topics covered 


n; апа a 
il By again in 1927 for the sevent 
most rivalling 
1 date from а 


0* Е 
8 so presumably this present work wil 
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i i i trical interpretation of e 
-ordinates and their transformation, geometric r i с : v Ба, 
эле е properties of quartics and their classification, reciprocation and duality, foci and € 
surfaces cones, and the general theory of surfaces. One or two examples and exercises aro inse 
intervals in the text. The index, contrary to most text-books, is both usable and informative. 


quations, quadriplanar co- 


F. N. DAVID 


Probleme der Statistischen Methodenlehre in den Sozialwissenschaften. (Third 
edition). Ву О. ANDERSON. 


Würzburg, Germany: Physica-Verlag. 1957. Pp. 358. 
24.50 DM. 


This is the third edition of a book first published in 1954 in the excellent series of monographs issued by 
the German Statistical Society. As far as can be judged, it has become a standard ‘intermediate’ text 
in the German literature. 

The scope of the book follows 
methods, on measures for de: 
probability, time series, 


traditional lines. There are chapters on the basic ideas of statistical 


As the name of the book implies, 
ensive account of methods. This is 
the standard text-book. ‘The level ol 
blems to anyone with ordinary schoo 
Е an this implies, in that the basic derivations and mathe- 
matical background for th х і і 


ion. Prof. Anderson | A ity of this 
ing hi н К rson has taken the opportuni y 
section of Kendall ED ed US Tám to line with that спинін for the Germ? 
other authors woul P ry of Statistical Terms. This is an admirable step WPIC 
in the use of tec ар it is not too much to hope that a growing uniformity 
owed by some standardization of Statistical notation. 


с. A. MOSER 
1958. Pp. 159 


" һе 
n Russian and English. Th 


identify the influence: 
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ө д = ttempt, 
its rescinding in 195 includes an a 


“notice? 
se of birth control pract the 
е urgent need to reduct dely 
ception—as yet not V anh 
ny Western demograph бсо from the English Sum 


avilY 
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© number of 
It appears that the 


ation projections, rest he 


Paper deals with the questions 
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The i ion i i i 
eas v gia is gained that in problems of methodology, the effort of overcoming the language 
қ Y will not be repaid, but the situation is quite different in relation to more sociological а с. 


volvi e 8 
Involving the Hungarian way of life. 
N. H. CARRIER 


American Housing and its Uses. Ву Г. Утумтск. New York: John Wiley and Sons 


Ine. London: Chapman and Hall Ltd. 1957. Pp. 143. 44s. 
ный sudy of density of occupation of rooms in the U.S.A, is one of the most interesting 
condon e. у in the housing field of recent years, and Mr Winnick's scholarly treatment inspires 
petsona e in his conclusions. The main data used is that of the U.S. Census statistics of the numbers of 
Ms в per room for households of different sizes and houses of different types and sizes. Mr Winnick 

PR mh density of occupation, space utilization as he calls it, by the ratio of persons to rooms, the 
v it while this is, he points out, an imperfect measure for many purposes, it is the best the data 
aeons > Information is also available about house rents and values, types of tenure and household 
hs - v in sufficient detail for effective examination of the relation between variations in them and in 
j^ PR ratios, Perhaps Mr Winnick's most important conclusion from this study is that size of 
^ O9usehold tends to be more important than either rent or income in determining variations in the PPR 
ratio, and he finds that in consequence the range of incomes is much greater than that of the PPR 
ratios, A multiple correlation analysis of data for a number of cities tends to confirm his conclusion. It 
" of particular interest that increases іп the number of rooms per person and decreases in the PPR 
ratios have been on the average small compared to changes in real income since the beginning of this 
Century, though, as Mr Winnick points out, this is entirely consistent with the other results. He 
Suggests that the great relative increase in the cost of building as well as the emergence of new durable 
Consumer goods such as motor cars, provides some explanation of the relatively slight decline in the 


number of persons per room since the beginning of the century. i ку. 

It is impossible to give any indication here of the many other important conclusions Mr Winnick 
Teaches. Their interest to housing experts in this country is enhanced by the fact that the general 
Pattern of space utilization in the U.S. in terms of PPR closely resembles that in this country. The 
excellent use in the inquiry of data on tenures, rents and incomes to throw light on the ini of 
demand for room space demonstrates the desirability of obtaining more information about them in this 
Country through the Census of Population. MARIAN BOWLEY 


Crops. An Inaugural Lecture by Sir 
Press. 1958. Рр. 30. 38. 6d. 


All Who attended Prof. Hutchinson's Inaugural Lecture felt it & ханада е prem i 
tion, by a master, of the lessons which the recent era of plant Vocum d oes initially available 
ax е art. АП impr ‘ovement programmes hinge on the rares in wild and сгор plants. 
ала Hutchinson gives a penetrating discussion of the Lum depends on population size and ecology. 
ӨтӨ ШЫРП. is lost їп the spread of an inbred crop 

inble frequency. Systems of improvement 

pes etic potentiality. The 


Ive early loss of gen 4 
ardeveloped countries, would appear to 


t breeding work, however, as Prof. 
pin isites for success. The selection 


Genetics and the Improvement of Tropical 
Јоверн Hurcuinson. Cambridge University 


ue forward, particularly in providing local 
don the interbreeding bulk or in recurre two con: 
TS chinson indicates with telling examples, ho This necessi 
со Sramme must keep to a clearly formulated aim.» continue to be а prog 
te Strol, The reader may expect that plant breeding W шз = Уш 
chniques, 
The University of 
na R. L. MOLER. 
Morphological Integration. By E. C. Prae Press act a8 agents. 1958. Рр. 317. 


а м НЕР 
Chicago Press, for whom Cambridge Univ! 
i. i d one а marine geologist. 
: tologist апа 0: ! 
ogists, one 8 aca chesification and correlation of 
tical am т their theory on actual observations. 
ui 


18 x ^ 
book is а collection of material by two biol 
ave tried 0 


f: 
b То, Olson and Miller have suggested theore 
Steal data and, contrary to general practice, 
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i i i view, but from 
j i biological point of view, ) 
i i judge of the worth of their theory from а 1 Ed 
me ener d vice de d is sensible and constructive. Broadly Е Ша ү I o 
Mi cil ны biological unit are correlated. (It was the exist 
P ements оп апу one Dio. gi т ^ 5 : iy 
that all раан led Harl Pearson to develop the theory of multiple and partial correlat dm 
ы 8 Olson and Miller discuss regression and partial correlation models, an А : : е 
ode d data are considered relative to their biological function. They thus get two types 
whert 


mw ns. un i i ciated 
the p-type in which correlations, etc. are found and the F-type in which are measures asso 

e p- r i 

bee ан to their biological function. 


Statisticians should read this book if onl. 


y to get some idea of the ‘old-fashioned’ view of biometry 
which is just now becoming fashionable agi 


ain. The wealth of data is also useful from many сініре 
view. It is also refreshing to read a book which is not a text-book of statistical theory Е we ees 
one in which the elementary statistical ideas play an important but secondary role in producing 
biological theory. 


F. М. DAVID 


түт. New 
Scientific Programming in Business and Industry. By Anprew Vazsonyi. Nev 
York: John Wiley and Sons Inc. 


London: Chapman and Hall Ltd. 1958. 
Pp. хіх+474. 1088. 


New books оп mathem ational research follow 
the same publishers, that it is natural to look fo 


treatment, or both. As to topies, the book undi 
Mathematical Program; 


ters 
tion and Inventory Control) 2 chap 

introduction and illustration, 4 on Linear, 1 on Convex 
Theory of Games, 


atical techniques of oper: 


in interest from 

parts of case studies, where they gain interest 

and lose in generality. ical 
tica 

-is different from that of other mathema! 


ffor 
Instead of proving theorems a great deal of e 
sense’ (p. vi). 


single “proof” in this book. 


perations Research Ву С. үу 
and E. LEONARD Arnorp. New York: du EST CHURCH 


L. Аско" 
and Hall Ltd. 1957. Рр. х+ 645. 965. 
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Sons Inc.; London: ChaP 
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е illustrative examples are t@ 
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the most accessible, and usually command the 
designed. A warning is given, how- 
research is peculiar to the industrial 


industria] problems since these are, on the whole, 
greatest Interest in the audiences for whom the book is primarily 
ever, that the reader should not therefore assume that operational 
Scene only, 

t three and the last deal with the general structure 


The book is divided into ten parts of which the firsi 
of operational research. The remaining parts, the filling of the sandwich, are concerned with a variety of 


Special techniques which have proved themselves to be of great use to operational research workers, 
The chapters on the structure of operational research are both the weakest and the most valuable 

Part of the book. Тһе ideas behind the design of these chapters are admirable, but unfortunately the 
authors һауе not been able to prevent some of the exposition being wordy and diffuse. Statements such 
88 ‘Tt ig Characteristic of operations research that in the solution of each problem new problems are 
uncovered ', and the phases of an operational research problem which are listed on page thirteen could 
? applied with equal validity to many disciplines and illustrate the difficulty of pinpointing in a few 
Words the essential nature of operational research itself. Chapter 2 ‘An Operations Research Study of 
a System аз а Whole', was disappointing. Although the problem outlined on p. 21 seems to require 
that the Output rate of the main product be reduced, the end of the first stage of the research left this 
Tate unaltered, nor is there any mention in the second stage of the research of such a reduction or, 
alternatively, of a step-up in sales promotion in order to keep pace with production. The dangers of 
Subopti mization are not discussed until much later in the book. The success of the market trend analysis 
scribed on pp. 49-52 is a resounding tribute to the robust character of A es quM and 9 
requently have been higher than the classical value of 5%. T За жы. dedo 

Tganization and оп the Formulation of the Problem', and the excellent case нїш ly р 2 
800d and succeed in emphasizing the comprehensive approach required in ОШ eet Dis 
SUN Chapter on the selection of a suitable team contains 3 fea eis ca which is so much 
Which members of the team should possess, but it is ae tion of costs there appears to be no 
Concerned with the maximization of profits and/or the minimi 


ch are encountered time and again in operational research 


In parts Ту-ут. fi zhi қ 
VIII, five types of problem w i oncerning queues and 
uident Тың pln fenton cnt alin, тойса ra 
reni т ЕЕ Un DUI кірге е problems, since a realistic formulation of any 


Strategy, Most statisticians will be acquainted еее Әң eA Батаа that the lack 
advantage, it proves a drawback in this 


Опе of them is almost sure to be in terms ано eds a 
of mathematical sophistication in the structural c ap force them into lengthy discussion and а spate 
°chnical section. The authors’ self-imposed сананат to omit many of the more interesting 
4 footnotes. Further, they are, for the same rens» IV in which inventory control is considered, is 
Cases of these problems. For example, most of p ee to, or of altering some of the parameters in, 
entirely deterministic; the effect of adding restri Bud rical example without any general rules of 
4 lincar programmin, madel is carried out only in & nume "tion on queuing models discusses simple 
Chaviour or нена тт bim being drawn; and although mi ans in series, of bulk service queues, and 
Ыр ОЧеВ at some length, it barely mentions tho problems of dés ors on testing, бш анд 
9f queues of pure лемі ith different priorities for UM bà prescribed reading for all Lo acm 
i : bin " X shou tion of a satis- 
Implere, қ Ы ‘hich comprise part IX she -tend beyond the construe! 
research m rm el wi eros Les ca e that their duties exten y 
АИС у be the wrong page 
actory қ ГТ worst appeared to і 
The boo oe MR 1 few misprints were noticed; br e dual theorem of linear programming. 
Dumb Ok is well set-up and fe 1 too short discussion О: list of references is quoted at the end of 
The ser = si cg om other hand, an excellent li: 

X itself is rather brief. On ti à КС rational research, so also 
each chapter, annot agree on à a es po dust qe edita 
th ot as most operational research workers E introductory бақы: » iven. Introduction to Operations 

f mo кішік to survey the field, and 


Xe are many opinions on the form whicl Sen ніш ds sh оша | 
al executive m for the sake of part IX. But the 


Place in the team for an economist. 


ti й 
н, research problem in itself to which paca vi 
arch will be of use to the intelligent, is search workers, if 0) М tls both to consumer an des 
ien tion which is 8 been omitted from the 
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a r 
Principles of Statistical Techniques. A First Course from the beginnings fo 


Е ; ; 8. 
Schools and Universities. By P. G. Моовк. Cambridge: University Press. 195 
Рр. 239. 22s. 64. 


" ісі е 
Тһе appearance of this book is timely for those struggling with statistics in schools, with Mure 
direction from above in methods and application. The author has resisted the таа ites 
deeply into theory, and succeeds in making the book readable, stimulating and beneficia he 
culations. His claim that ‘this book is an attempt to pu 3 
ods to students, who are fundamentally interested in the prac 


d 
› is well founded. The thirty exercises at the end of the secon 
ter for all tastes. Тһе mai 


itfalls 
ulation of data, so rarely given, is nicely handled, and the pitfa 


rated. A genuine attempt to link up probability, sampling and ma 

ther as a natural consequence of events or as the starting aera 
a theory and once again the examples are plentiful. The emphasis, perhaps by bold type, of impor 1 8 
ideas, principles, would have helped, and more summaries like that at the enc 
Chapter 10 woul 


"ved and 
able. The omission of X(O — EYE, where О, В are observed а 
expected frequencies in Chapter 11, is unfortunate, but economists should find the work on indices 
time series in + i 


Xen 10 
portance of correlation is giv p m 
; Ts’ and, secondly, by the exce 
5 aragraph of the book. 
This book deserves wi 
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> 
CORRIGENDA 


(i) Biometrika (1950), 37, p. 96 


On the statistical independence of quadratic forms in normal variates.’ Ву А. C. Aitken 


line 30, read AVBV (or BVAV) for AVB (or BVA). 
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AN INVESTIGATION OF HARTLEY'S METHOD FOR FITTING 
AN EXPONENTIAL CURVE 


By Н. D. PATTERSON Амр S. LIPTON 
Rothamsted Experimental Station 


[INTRODUCTION 


H 
artley (1948) showed that the parameters of an exponential regression curve 


40-а-Й (0<p< 1) () 
considering the regression of y, on z and on 
1951) has described an iterative method 
d p which is very suitable for routine 
ter is available and which, moreover, can be used 
ting facilities are limited, however, Stevens's 
ods, including Hartley’s, are 


ъ— 1, can be estimated by 
ce then, Stevens ( 
es of a, f an 


Where x - 0, 1,2, А, 
eg partial sums of the y,. Sin 
applic CN the least-squares estimat 
Wim ыы when a high-speed compu 
Ecl sare not equally spaced. When comp" 
Still i is somewhat laborious. For this reason other meth 
п common use. 

t Hartley (1948) showed that his estimates of p are of high efficiency for very large л in 

16 special case g = 0. It might reasonably be supposed that the estimates are also highly 
“ficient for finito y and fora unknown although this does not necessarily follow. The purpose 
9 the present paper is to discuss the efficiency and bias of these estimates and of other 


esti Basie : ў tterson (1958). 
mates of similar type using general for mulae derived by Ра ( _ » сте 


es оға and В as these can alwa; 


e will n А ар? 

А ot be concerned with estima Е 

© linear regressions of y, оп 7 Where” is an estimate of p- The resulting fit depends on the 

У ==. тез estimate. Tf r is subject to large errors or 
r are small 


st-squa i 
he other hand, if t 
d fit can al 


he errors and biases in 


«aon: 
s of r — $, where # is the lea 
S 
and tj 16 fit may be very poor. On t - eis cites 
he true relationship is in the form of (1) 28 


pRATIC ESTIMATES ОЕ 


LINEAR AND QUA 


Prov; - 
Vided that "ў w, = 0 the expression 


: n-l 
> WY Ay (2) 
d o ee 
r= 21 Аз ^ 
У WyYx-1 
1 Lostimateso this буре have been proposed (though 
ral es 


Proy; 

1 4 

des consistent estimates of p. Sev? | 4 
ated functions of ? itself. 


1 : . . 
lecessarily in this form). These include: ich the We 21° complic à 
9 Stevens’s least-squares estimate £n W pics r functions of the yz. These estimates 

"n Estimates in which A and А» are oti m es with the w, not dep ending ge 

1 : i 
А ear 68 
Һау © called ‘linear estimates’. Suitable € cases ® = ^ ‚бапа T. dA, ai drati 
een discussed by Patterson (1956) Е: tions of the y. 59 that 4150443 = циа 
fy #7 Stimates in which the Wz 79 linear WU не à ‘quadratic estimates’. As will be 
lo s ш be ae „atje estimate. 

8 ns ates W : dratic es 

ho of the у„. These pra estimate of pis W” Biom. 46 


ng 
18 in the next section Hartle 
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i ible, by a suitable choice of the w,, to obtain both linear and quadratic ae 

= » аа for some particular value of p, say py. The quadratic es upon 

чой TO a cover a wider range of p (around p,) with high efficiency than 4 be 
sein чы linear estimates but, as pointed out by Patterson (1958), they may be вир) 

ge biases in some circumstances. | к „Ж 

to E. Hartley's estimates of p are considered in detail some properties of linear "eet 

particularly those having minimum variance, will be presented. The variances an ari 

tions of Hartley’s estimates will then be determined for comparison with the 


А adently 
variance linear estimates. In the following it will be assumed that the y, are indeper 
and normally distributed with variance o?. 


The expectation and variance of a linear estimate of r are approximately 


n—1 x n—2 
|р ЖАУ 


2 UST о? (3) 
5 1 
40)-р- [n=l 2 p 
| (s) 
1 

í тет . т-2 

[ELS шамы) ya m 
and var (r) = 1- E 1 


NE NA 


These expressions are suitable if о? 
Patterson ( 


p = py if 


is small relative to 42. num when 
1958) has shown that for р < 1 the expression for var r is at a minim 


(8) 

(1+ pj) ш, Роша, 1) = ky (pg + ks), 
59-1; = 
are not all zero and k is such 


where x = 1,2, 


e Ws 
ИР 4 hat th 
W, = 0; ky is given any convenient value such tl А 


2 ЖІБЕ 
that Уло, is zero, A solution of equation (5) (with kı 


(6) 
W, = Z dyp (=i), 
1) 
с, A 
where dj = Е. d 4. | 
4) 2 су , 
d а-в Ф ү 
"er €50,29 V -po)l—-pya-h) —— 
ij t зү о —— 
"с пә <. 
Equation (5) shows that, in this case (i.e. №, = 1) 
т-1 2 | 
1 4p 2_ п т-1 
( p ) У Wy 2р X Ww. = x w, põ 
when p = ру Consequently, the variance of the minimum variance estimate 18 
: (0) 
var (r) = и ——— | о? . 
Wpf gas 
where the W, ате as defined in (6). rp o): 
Linear estimates havin i | 
g full efficiene = 
It is also Possible to obtain es ы 
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given by (6) tend to be 
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Th i i 
vasa nee will be denoted by r(1). As po approaches 1 the w, 
portional to У jd;; where x = i. When py = 1 Ы 


2 
Cy = Cj = in—-j)n (i<j) 
" | < )). (11) 
onsequently r(1) is given by equation (2) with 
w, = e(n- x) (2e — n). (12) 
$, theleast- 


А: m Я 
S pointed out by Patterson (1956) the linear estimates r(pg) are equal to 
mulae given above for the expecta- 


Squares i 3 
s estimates of p, when py = f. Тһе approximate for 
ist-squares estimate $. 


tion 
ar ari i i 
Q ra variance of r(p) are also appropriate for use with the lea 
кайта i 5 : ai В 
ratic estimates of p with minimum variance when p = ро are given by the ratio (3) 


with 
T Wr = хауа + Їй), (13) 
in a v= 6 ў=1,?,.. and 1 is non-zero and the d;; are as defined 
к. (8) for ро < Lor (7) and (11) for p, = 1. Only the value of p, and the ratio К/Т are 
з a 1 to specify these estimates. Thus, the I: and lof (13) can be multiplied by any non- 
ratic E number without affecting the estimates of р: А convenient notation for quad- 
"ed stimates with minimum variance when р = Po is, therefore, (ро Е). This differs 
ghtly from the notation previously used by Patterson (1958). 


The expectation and variance of r(po: ЫП) are approximately 
2 2 2 ә)» 
| я Е 4 2kp—1l) № =) (14) 


42 —1, at least one of k 


9 м U 
оғада (1— pk) trace D + (k—pl) trace DE . 
е Pl — 7 (e lp) Lo (k+lp) Fo 
ча var (r) = 0? (ea n (15) 
pi И 
wl d nz} а at р 
lere де z Wp Re 5 Wi № = » WW. (16) 
0 
Kass (27) 
j . 
Dis the matrix of d;, defined by equations (7) and (8) or (7) and (11) and Uis an”— Іжт-1 
Matrix ij жүт pd 
00- 0 0 
© : 10 0 0 (18) 
T lee Thus the vari 
he А ее Patterson (1958). us the variance 
derivation of these expressions has been gi ie "e (15) are identical. 
de Hartley's estimate. The 


(bo, kll) is; = 
o kfl) is independent of kil. huge рд (1,00) ене 
be considered in 


8 will ily of 
be shown later the family 9 ill therefore 


“Xpec 
Hae re 
ation and variance of 1,00 y (0 and (11) and 

2. 15n4 22). (19) 


rather more detail. 


тЫ ; 
Nese estimates the d;; ате given ы 
F , traceD — 002—9 киер” pt ЕП) when ? is very large it is 
or th P «as and V. малое 4 
9o © purpose of stud. the bias АУ” 

“venient z РМ he alternative orm 55 
to express the W, in the " (20) 

peat +P 2 8- 

18-2 


т, = athe 


284 Hartley's method for fitting ат exponential curve 


where d = — 1/(1 — p)? and a, b and c are such that И = W, = XW, = 0. For large n, constant 
9 = p"! and x = nz , m T (21) 
We 10): 00020 0%), 
j е 3(1—0) (22) 
where b =-2[2404 na |, 
23 
lad! = 3149, (23) 
UT? өз Digi (1—0) (3 — 0) (24) 
Е һ-( ++) ИТЛЕ 
__@- 9) 4(1-0)+ с" (25) 
and бан 210 — (In 0p C 
26 
fn? var (r)a? > (In 0)? (L, — Lj) ТА. (26) 


А 4 s 264 shat 
It can also be shown, using the technique described by Stevens (1951, pp. 264-5), i 
n? var r(p) tends to limiting values for constant 0 as п is increased. Hence the efficien 
of r(1, k/l) for very large n depend only on 0. 


The bias in r(1, k/l), when k+l + 0, tends to 


g? (In 0) (150.— k) — 20-1) In 0} A 1) (27) 
=| Зо t 


as n increases with 0 constant. 


HanTLEY's ESTIMATE ог p 


Hartley (1948) noted that, ify = а Вр, then 


(28) 
V, = 6+6, + зҮ, 
(29) 
where 5 = Y, Lx 37.1 EE iy 
25, = -y y, 4... Уи) for even "| (30) 
= 00 2001 ya suo Уз) — ры) for odd n 
апа 


р = (2+8,)/(2—Ь,). 


squares. He called this procedure ‘j 
The observational equatior 


Y = Lb xb, + Yo, (31) 
where y is a column vector with elements 
х 18 à column vector with element; 
are then 


H А à 
Ya li is a column vector consisting of n 1's же 
SPO) 1D, жд. The least-squares normal equatio 
y= nb, + E ЗУБ, 
ху = Хх, +, Үр, ы 
Ху- Y'1,0,4 Y XB, ЗБ, 


ГТ See also, Hartley's note printed on P- 293 below. Ed.] 
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For the purpose of obtaining an explicit expression for b, (and hence for the estimate of p) 
it is convenient to introduce the matrices 


—1 1 0 0 

0 —1 y Q 

Т (of order n — 1 xn) = 0 ó e$ Т 
Yer 


Valy- 
= У-!— > 
Di = У-га, 


J, an nxn matrix with diagonal terms equal to (n — 1)/n and non-diagonal terms equal to 
— т, and 1, a column vector with n — 1 elements each equal to 1. It can be verified that 


TYT zm 
and that the elements d;; of D are as defined by equations (7) and (11). 
Elimination of 6, from (32) yields the two equations: 
X'Jy = x'Jxb, + X'JYb,. \ 
X'Jy = Y'Jxb, + Y'JYb.] 
Since er’ = 1’, Ty = y, — yo and 2TY = y, + уу, where yy isa column vector (Joshi Vas) 
and y, is а column vector (/1.)».--У,-г), the regression b, can be written 


(33) 


bs 2(yo+ y) Di(yi — yo) 
(Yor У!) Di(yo- y) ` 
Hence the estimate of p is oa 


(34) 


: +b, - (уо + ОАР 
-0, (Уғу1) Diyo 
sen is equivalent to equation (2) with the w, given by (13), ро = 1, and kil - 1. Con: 
whi Ж y, Hartley's estimate of р is equal to r(1, 1), one of the family of quadratic estimates 
en. tend towards full efficiency as p tends to 1. 
"e will be noted that the estimation of p does not depend on the definition of Y. Nor does 
а estimate of a, given by —6,/6.. Alternative estimates of В can, however, be obtained 
?pending on Ү,. With Y, given by equation (30), 6p? is estimated by 


(= 02/05) — b, — (n — 1) 0/2. 


bol tye 


If y. ; . 
EY is taken to be zero the estimate of 2 is — [0/03] — bi. 


ALTERNATIVE INTERNAL REGRESSION METHODS 

Тһе reasons behind Hartley's choice of equations (28), (29) and (30) were not made clear in 

€ origina] paper (Hartley, 1948), and other quadratic estimates of p obtained by ‘internal 

Sression › methods have sometimes been used as alternatives to r(1, 1). Finney (1958) 
"nSidereq the simpler equations 


y, = (1—p) +руг-1 (35) 
ang les Yep 
Уна 2 Е P (y aa +0.) (36) 


MENT l+p 
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iled investigation 
hich lead to r(0, оо) and 7(0, 1), respectively, as estimates of p. In a detailed investig a 
af the case n = 4 he found that Hartley's estimate of p showed no special is nie or 
r(0, co) and 7(0, 1) and suggested that this result might also be true for larger values d Б 
Other estimates, іп the same family r(1. /:/]) as Hartley's estimates, can be obtainec 
(28) by replacing equation (29) by 


31) 
У, +, k+l +0. 


if 
The estimate of p in this case is (1+ Kb3)/(1 — 16.), the estimate of a is — bs] (Kk +1) Y ee 
Y, is taken to be zero the estimate of P is — [bs] (I +1) b3] —b,. An estimate ғ(1,1/ dum 
k+l = 0 exists but cannot be obtained from equation (28). In the introduction to his P S s 
Hartley (1948) appeared to be considering r(1,co) rather than r(1, 1) (see studie [8 ^e 
Hartley's paper). Monroe (1949), reported by Anderson (1956), developed greed 
the case 7(1, 0). Anderson (1956) noted that different estimates (sometimes very Ros nii 
are obtained by Hartley's and Monroe's procedures. White (1956) has also conside 
alternative estimates in the family 7(1, k/l). 


A more general relationship than (28) can also be used (Patterson, 1958). This is 
Yo = bı + bop§ +s, 

Ү.-руҮ, = Ey, ly, 

The estimate of p in this case is T (py, kfl). 


type have the advantage over r(1, k/l) tha 
the range of values of p occurring in pra, 


where 


f this 
It is given by (py + kbs)/(1 — 163). Estimates of 


ey jithin 
t they can be arranged to have full efficiency W 
ctice. 


(38) 
standard error of r = бов, к 
36 
bias in r = E(r)-p= Bo?|p2 ( М 
4 
апа T = BIS. ( 


The tables are given in terms of 0 


-nin the 
= p"-! rather than p because large values of p within | 
range 0 < р < 1 tend to become m 


ore important as n increases. If n is changed by changin? 
the interval but not the range of the independent variate then р”-1 remains constant. ian 
n? varr and n?biasr tend to non-zero limiting values ав ^. is increased with 0 cons 
(0 < 0 < 1) both for = 2 and for r = r(1, Е). E 
Values of 8 for ? are set out in Table 1 for various 0 апа n = 4 to 7, 9, 12 and 20. ы 
sponding values for the estimates (1, k/l) are given in Table 2; these values are indepe™ The 
of ШІ. As т increases the Sn? tend to limiting values which are set out in Table 9- 1 
efficiencies of r(1, 2/7), given by the ratios of 5° for ? and r(1, k/l), are set out in Table 3 8 
for very large n, in Table 4. 


Jue 
The estimates r(1, 2/1) tend towards full efficiency as 0 approaches 1 but thisis of no V? 


ч this 
in itself as the standard errors of even the efficient estimates tend to be very large ™ 
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region. They ari evi i i 

E iot bue how = highly efficient over a wide range of smaller values of 0 

act the efficiencies only tend to become low for 1 | 

им y ten or large » and very small values of 0. 
generally high efficiencies of r(1, /2//) compare very favourably with the efficiencies 


of other non-efficient estimates of p. Thus: 


(а) The estimates 7(0, 4/1) are very efficient when п = 4, but their efficiencies fall off 


rapidly when x is increased (Patterson, 1958). 
E us efficiencies of linear estimates, (ро). of the type considered by Patterson (1956) 
8 relatively quickly ав the difference between р and p, increases. Table 4 shows that 
for very large n, r(1, kfl) covers a rather wider range of 0 with high efficiency than two linear 
estimates together: r(1) and (ро) with pit =0-0071. The simplicity of calculation of linear 
estimates is still useful in practice, however, for example as initial estimates in the iterative 
least-squares procedure of Stevens (1951). | 
(c) Estimates (ду, К/) with suitably chosen fo! 


of 0 than (1, 6/1). Thus if ра is taken to be 0:00 


large n) for 0 > 0000075 whilst the correspondin 


small values of 0 appear, however. to be rare in practice. 
The values of 7 set out in Tables 5-8 give, when multiplied by ор, the ratios of the biases 


to the standard errors of ? and of r(1, ЕЙ), with kfl = 0. 1 and оо. Unlike the variances the 
biases in r(1, 2/1) depend on ЕП, Limiting values of Z'n? are given in Table 9. Negative values 


of T indicate that p is underestimated. 
Тһе biases in Hartley's estimate 7(1, 1) (Table 7) are often considerably greater than 
those in the least-squares estimate 7 (Table 5). For many purposes, however, they can 
ge standard errors 


probably be ignored as they are no the estimates of p have lar; 
and are consequently already un For example, if n = 12, 9 = 0:001 (i.e. 
P = 0-53) and c = 0-10 the standard erro 08 and the bias is about 0-01. 


TIB would have to be about 0: 49 for the b of the standard error, in which 


Case the standard error would be as high as 0:32. 


Тһе biases i i d r(1, 00) are gener 
ses 1 timates 7 , А s 
n= 4 there E in using € se estimates 1n preference to (1,1). Ifn = 4 


is i ittle value as 
the biases i Дег than 1,1) but even this is of liti 
s in r(1, 00) are sma er th à я 
much simpler н at өй өң; 0:8 15), with as ency and very low bias are available. 


Кн "(0,2 75 ral independent estimates 
Biasin Hartley'sestima 


f te may bee tables, provided that an 
of p are required. The bias ca” 


à of ae this pu be determined from var- 
ccurate estimate of 0 is aV? == ШЕ only with small biases. If 
metho 


lation between replicates. Chismes. dag he T and 8 may no longer be 
the biases are large the approximations invo 
is likely to be poor. 


Adequate and the estimate (1,1) by ^ 
оа for dealing with pias is to pod a in r(1, И) ате zero. Great accuracy in 
ab 10. These values №1 are mi US alte of ЕП, 1-5 88У; can be used for most purposes. 
is not required. Thus forn 7 T can also be obtained using the tabulated 
emis а scaling factor, 


Тһе standard errors and biases 0 i Ir wher 
andar ноп — (пр) 
Values of 8 and T. Thu (еее ы y is an estimate of pand A = — (In nm. 


s, for example: 
18 often of greater interest than p itse^- 1 
dÀYY 45 YAT e) 
var (А) = fe (ШІ yar (г) = тірі 


ying between 0 and 1 cover a wider range 
71 the estimate is 95 % efficient (for very 
g limit for 7(1, k[l) is about 0-0015. Very 


t large unless 


satisfactory. 
r of r(1,1) is £O 
jas to be one-half 


ally much larger and except when 


requires to 


(1, 2/1) with values 0 


(41) 


then 
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S.E. (r) So (42) 


so that —— mp — mpff 


The bias in Â is 


bias (Â) = 4(4)-А- d (0), bias (r) 4 | (=) var (y) (43) 
- bias (r) п Edd | (44) 
тр пр" 
i.e. Баз (Л) _ _ bias (r) х.к. (r) (45) 
S.E. (À) S.E. (r) 2p 


In these expressions all standard errors are taken to be positive. 

The efficiencies of аге, of course, identical with the efficiencies of r. The loss of efficiency 
of Hartley's estimate for low 0 may be of some importance when the estimation of am of 
primary interest. The total amount of information on А is proportional to p2(Inp)?/var? = н 
which has its maximum when 0 is small. Thus when n = 20 the values of / are as follows: 


0 0-000001 0-000125 0:001 0-008 0-064 
I 0-228 0-275 0-289 0:267 0-146 


Hence useful estimates of A are available when 0 is as low as 10-6 and it would therefore 
be worth while in this case to use a better estimate than the 72 % efficient Hartley estimate. 

As in the case of the estimates of p, the expectation of the Hartley estimate of A is closer 
to the expectation of the least-squares estimate than are the estimates with 0/1 = 0 on p 
This сап be readily seen from equation (45); the second term on the right-hand side 1$ 
approximately the same for all four estimates. Р 

Thus similar recommendations can be made for the estimation of both p and A. If high- 
speed computers are available Stevens’s method should be used as it ensures a good HM 
all cases in which a good fit is possible. If computing facilities are limited and the ordinates 
are equally spaced, Hartley's method can be used and in most cases will be superior, in the 
sense of having high efficiency and relatively low bias, to many other inefficient estimates 
which have been suggested. A worthwhile modification that can be made is to replace 
k/l = 1 in the Hartley procedure by k/l = 1-5 or some other convenient value determine 
from Table 10. This will often result in a reduction of bias. If the curvature is very marke 


so that most of the y’s lie near the asymptote it may be desirable to use the related estimate 
"(ро КТ) with py not equal to 1. 


Summary 
The efficiencies and biases of the estimates of p in the equation y = æ — 3р“, obtained by the 
method of Hartley (1948), have been examined numerically using general formulae derive | 
by Patterson (1958). It is shown that while Hartley's method generally leads to estimate 
having relatively high efficiencies and small biases, and can therefore be regarded 25 
adequate for many purposes, the same conclusion does not hold for similar methods wbich 
are sometimes used in place of the original Hartley method. 
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Note added in proof 

Another method of estimating р which deserves to be mentioned is to replace the We 
of equation (2) by suitable functions и, "Ve and solve the resulting quadratic equation 
mr. In the case m = 4, using 4, = — 10, 5, 5 and v, = — 17, —5, 22, efficiencies of at least 
99:99/ are obtained over the entire range of р and the estimates are close approximations 
бо ?. For n = 5 suitable values of и, and v, ате th; = 77 16, 5, 7,4and — 14, ра 22, 10, 26. 
There are grounds for believing that in general high efficiency can be obtained ч "n 
ав easily by this method as by the use of quadratic estimates. DE 
the standard error of ? 


——— — 


Table 1. Multiplier ‚ S, of сі) in terms of 


0 1 | n=5 n=6 | n=1 | n=9 n=12 | n = 20 
„=“ =: 
| | [ | 1:028 
.15: | 
4 | 


| | 
5 -06 1-049 2 
118 1-095 1-069 9 02 
| 09 ed > na | 1100 0-996 0-847 0-561 
оа | To aic um | ше | UO 0-470 
e 2 "o 56 87 0-668 0-382 
1 | r312 1-166 1-056 | 0-871 ) $i 
| od | 1421 | 551 1114 0-860 | 0621 | 0-328 
0-064 | 1-777 | per 1-398 1034 | 0718 | pc 
| re 2-560 a 2431 1.144 1178 | 0575 | 
| 0-512 5:024 9° мк с Кен = 
| i dard error of (1. ЕП) 
- trong. S, Of орт terms of the stan à үз m 
Table 2. Multipliers, >> P ересен 
іші | —Ps 90 | 
"as И | 1 n=9 n= 12 п = 20 | 
= п = 
0 n=4 п = 5 n=6 | ний 
Г c ü | 
1 4 1.272 1-349 past pun 
24] rh 059 0-906 6 
Е- | ow | | 158 | 1059 | 4-9 | 0:487 
| 0-000125 | ed E 010 0.970 | pn eas 
|, 0:001 5 | F e | 1.182 1066 | 0:861 | 0-676 ү 
‚000 1-32 | 1-176 ee 0:860 o1 m 
0-004 | 19426 1-288 on 1-034 0-т18 0362 
белі | peii | 1-668 | 1:744 1178 0-51: 
e | 2:56 9.981 2 - 
9-512 5.024 m 
Bac 22 


2 p > н, 7 ТЫ 
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Table 3. Percentage efficiencies of r(1. kjl) 
заган prt жақан peo | pe 
| 0 n=4 hom nm n2 п= 9 | телі gp we 
| | m 
| | _—— | 
| | ж Eh 
| 60 96 89 SI 74 63 51 9% | 
0-000001 | 97 90 86 82 s0 T4 ai 
| 0000125 | 98 94 92 90 SN 8s 5 | 
0-001 98 96 95 94 94 93 i 
| 0-008 99 99 98 95 98 98 9: | 
Жөні á bé 
For 0 > 0:064 the efficiency is almost 100%. 
Table 4. Percentage efficiencies of linear and quadratic estimates for very large n 
Hu á ааа ал _ E ФЕНЫНРЕТ 
| Linear estimates Quadratic estimates 
0 —— * = " = = 
| r(1) ТЫ nk) | (po ШЫ 
0-000001 жа | 52 m | 86 
0-000125 ms 83 88 96 
0-001 | = 95 94 100 
0-008 89 100 98 100 
0-064 94 94 99 99 
0:216 98 87 100 99 
0-512 | 99 = 100 99 
1:000 | 100 | = 100 | = 
banana ш |. ы | m 
* pot = 0-0071. 
m Я e. 
Table 5. Multipliers T of [В in the ratio (bias of ?)| (standard error ој?) m 
f | — - — = e 
i = 20 
| = 5 | “= 
| 0 | n-4 n=5 n=6 | mai | T] | п = 12 | p 
| ——— = рае = | | | aa 
—1 SE 9 
ШТ | | | | 0-00 
0-0 0-204 0-096 0-056 — А 010 82 
0000125 | 0223 | 0059 в ш 0-019 eed об. 
. 5 9 — 0:127 — 0:247 —0:3 _ 0:46 
| 0:001 0.253 0-06» — 0-061 — 0-150 0:978 — 0:376 0-387 
| 0-008 0-344 | 0-119 — 0:019 —0-115 228 | —0:323 70-225 
0064 | 0-633 | 0354 0-187 быз. | come | шуш | “іші 
0-216 1334 | 0.743 0:5 —0:055 2 | 7000 
1 511 0-359 0-175 0037 | Q0 
0-512 2-451 1-706 1-266 | 360 | ‚ 
29 0-978 0-628 | 0: | p 
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т(1. 0)}/{standard error of (1. 0)} 


Table 6. Multipliers T of af in the ratio {bias of 


ре Es 4 
| 0 | й=4 | n=5 | n=6 n=7 n=9 n= 12 n= 20 
= а 
0-0 (T = co for айт) | 
0-000125 18-983 15:141 13-039 LI- 10-067 8:676 6-800 
| 0-001 9-479 9051 | SAT | i 7188 6:360 5.144 
0-008 5-5 5-650 5608 5 | 5052 | 4742 3-942 
0-064 | 4122 | 4208 4 | 22156 | 890 3:316 
0-216 4.315 1493 + 4436 | 4187 3-579 
| 0-512 | 6-626 6:846 6- 6-741 | 6370 | 5456 


дц 
J — | 


Table 7. Multipliers T of сір in the ni 


1 | n=9 n=12 n = 20 


0 n=4 n=5 


= — | 


oy 44 pa | 
0-534 1:02 
0-000125 0-520 0:162 1 то 0:852 | 
0-001 0-522 0.655 0:812 | | 9-681 0-708 0-659 
0-008 0:563 0:574 0 613 | | 
0-064 0-719 0-580 | 
0-216 | 1:233 | 
0:512 | 2-520 


—— 


| 
|В 
| 


0 в=4 | n=5 енен x: 
— н Қы. i s. е 9.180 9-072 
a) aaah | 
9:0 _озвв | -0612 oio 1 =, dta 
0-000125 — 0-368 б Ñ —1:150 | a 
0-001 
0-008 
0-064 
0-216 | 
0-512 | 


Pe 
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Table 9. Values of Sn? and Та? when n is very large 
Sn? Tni 
0 y pce focum А 
$ r(1, kil) 8 | nkh0) | ond) r(1, o0) 
| M 2 | | Б 
Икан | | | 
0-000001 1593 | 28-76 
0-000125 | 90-2 11-67 — 830 
0-001 66-0 7-06 -10:79 
0-008 47-0 3.75 —12-13 
0-064 35-6 1.77 = 
0-216 36-8 1-06 — 16:38 
0:512 55-8 0-72 — 26-91 
Table 10. Values of k/l for zero bias in r(1, КЇЇ) 
— = = ee MINOR CMT 2 — c LE 
0 | n=4 4 c5 п-б n=7 | n=9 m= 12 n= 20 wee 
ге | | )0 
0-0 | 2:50 2-67 914 |-10901  -344 |-180 Г 
0-000125 2-49 211 247 | 280 еп | sar | № 
0-001 | 2-53 1-92 1-97 2-06 2.15 2.293 | 221 
0-008 2-81 1:79 1-63 1-62 1.62 1-62 qua 
0-064 4-18 1-58 1-48 1-40 1-35 T MES 
0-216 7.71 2-16 1:51 1-36 1-28 1. жн 
0-512 19-6 2-54 1-58 1:37 1-25 1+ += 
1-0 © | 3:00 | L67 | 140 1-25 мя | № 
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THE EFFICIENCY OF INTERNAL REGRESSION FOR THE 
FITTING OF THE EXPONENTIAL REGRESSION 
Bx H. O. HARTLEY 
Statistical Laboratory, Iowa State College 


Ten 
years ago (Hartley, 1948), I suggested a method of estimating the parameters а, 7 


and p of the exponential equation j + fo? 
= OTe (1) 


. The notation of (1) is that used by Finney (1958) 
from the direct least-square treatment of 


(2) 


„n. Finney 


b s 
x what I called ‘internal regression’ 
ho 3 ; 5 "E 
its compares my estimators with those arising 
e auto-regression equation 
Yiz = 91-р) + PVE 


оға y; are the observed values at n regularly spaced points s = i = 1,2... 
and ера use of this equation to Dr St C. S. Taylor, deals fully only with the case n = 4, 
hone т ers both the model (а) where the errors in (1) are assumed independent and of 
iius n "SEEN, and (b) where this assumption is made for the errors in (2). My paper, 
ute : examples of the response of crop-yields to independent applications of fertilizers, 
squ mainly concerned with (a). In comparing the estimators arising from the direct least- 

are treatment of (2) with my method (which is essentially a least-squares method applied 
Finney states (p. 376): т ble to understand 


have been una 
be claimed for this procedure on the evidence that Hartley 
leulation, it is considerably more 


of +} i 1 laborious than either 
fas Eu considered above, because of the various partial sums that must be formed. So 
it is its precision is concerned, the construction by way of regression 18 irrelevant because 
es no account of the pattern of the errors.’ 

E do not think that the full facts of the case justify thi 
Scuss the pattern of errors (called by me residuals, see $ 3-1), 
Properties of these residuals which are indicative of therelative efficiency o 
of estimation. It was the study of these residuals which prevented me from | 
Square estimator arising f ms (2) under assumption (a). That 18, I deliberately decided, on 
Бі Strength of the error residual relationships summarized in my paper, against using the 
Very estimator which appears to beadvocated in theabove passage by Finney. This estimator 
Aas been critically examined b; Patterson ( s considered as severely restricted in 
y examiner Y his study of Patterson’s 


1958) and i 
Benera] use by Finney himself (1958, P- 387, line 4 from 


bottom) after b 
result te the motivation for my estimators—I am 
à S. Мог 5 :nes 6-16), Í clearly state á Sade 
Mstifying =e iu parse of their large sample variances and оков 
Obta; i form 

ained by comparison with t anos 


ulae of the maximum likelihood 
esti "form: 
Mates. T agree, of course, 


rge sample Pe ance of an estimator does 

Not nog | à but I did no more than follow 
к essarily guarantee its өш 
mmon statistical practice ul 


to t; 

"^ he progressive sums of (9)) 

ра at particular advantage сап 
Pplies. Viewed as a regression са 


is assessment, for not only do I 
but I also point out certain 
f various methods 
using the least- 


i О 
pmo inn e cedures on the grounds of 
1 &dvocev о 
mulae are given for case (а) 


о; 
ent large sampl rformance. 
e large samp 
Ог the special case & = 0 nd n largo exhibited in Fig. 3, where it will be seen that the 
nd graphical < 


n 
Pages 43-4 of my paper à 
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efficiency of the method tends to 100% as p — 1 (my q = —nInp-> 0) and remains high — 
the practical range of p. At the time of publishing my paper similar large sample results for 
the estimates of all three parameters in equation (1) were in the course of being worked gut 
by а student of mine (Miss I. Trivedi) and her results were subsequently recorded in PE 
Master's Thesis (University College London, 1949). These results for large n again show 
the high efficiency of the method for the practical range of p. diti 

In the present issue there appears a paper by Patterson & Lipton (1959) who kinc y 
allowed me to have a preview of their typescript. These authors give a most comprehensive 
evaluation of my method of estimation for both large and small n using the elegant formulae 
previously derived by Patterson (1958). т 

It is of interest to compare the limiting efficiencies (for n — со) obtained by them WI s 
those computed from Trivedi's approximate formulae. Тһе results are shown in Table 
and appear to be in good agreement. 


З Ae А + 3 peti тат 
Table 1. Comparison of the percentage efficiency of an * internal regression estimal 


Computed from | Computed from Patterson’s 
Trivedi’s formula | and Lipton’s formula 
4--іһ0 | gon Mem 
0 % efficiency 0 % efficiency 
u Em 
| 0-064 99 
| 3 0-0498 99-7 - — 
4 0-01832 99-2 === a 
5 0-00674 98-0 0-008 98 
6 0-002479 96-2 — = 
7 0-000912 | 93-8 0-001 94 
8 0-000335 90-9 — = 
9 0-000123 878 0:000125 88 
10 0-000045 84-6 — on 
0-000001 72 
eae = ne 


jas 
" А па bic 
However, for small values of т, Patterson's апа Lipton's results on efficiency 9! 


could certainly not have been inferred from our earlier large sample results. Indee » oti? 
extremely good performance of my estimator of p for small values of n, which was de 50 
strated by their calculations, was a pleasant surprise to me. White (1956) (а. ны no 
O. Kempthorne) had independently obtained some of Patterson's formulae, but ha 
made a comprehensive numerical evaluation of efficiencies (or bias). uae? 
Patterson and Lipton recommend the use of Stevens's (1951) 100% efficient aie тез 
estimator where a high speed computer is available and I agree that where computing 07 of? 
are faced with the fitting of this exponential curve sufficiently frequently; the writing ; 
High Speed Computer Program specifically for this computation and for mode 
warranted. "PT 
Powerful methods of analysis, such as those employed in the above paper per now 
detailed assessment of the relative merits of estimators. The question now arises, 


Ў ре 
sensitive are the more refined judgements to likely departures of the error pattern from 
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assumed Tarer 

a с icum For example. when the first ordinate of the exponential curve (measured 

intone : с е is 20 times as large as the last ordinate or even larger (Patterson's & 
< 5), itis doubtful whether the assumption of constant variance will be aa 


satisfi Vom dh ; ; 

2. Біпсе it 15 only in this range of Ó that my estimator fails to be fully efficient, a 
ss conne of this drop of efficiency under variance heterogeneity may be desirable; 
y, this raises the important question of the ‘robustness’ of the two estimators H sm 


sure we ar ў i i 
ve are all agreed that as much information on the physical mechanism (generating the 


respo i i 
ропзез y) as one can possibly obtain, should be used as a guide to set up an error model 


Thus ini ili i 
us in independent fertilizer trials one would usually be inclined towards (a), in time series 


Грае such as growth curves towards (b) or some other stochastic alternative. How- 
ia ^d e ue model (such as the model (а)) is accepted not because of definite evidence as 
y but rather because of lack of evidence for a better alternative. In such situa- 
ared for (at least slight) departures from the accepted model and the 
een h 5’ of our optimum estimators becomes of great importance. 
son (1958) seems to have made a start with such studies and reports (p. 399) that he 

he model (b). Finney (1958) also dis- 


h > 3 
as found Stevens's estimator fairly effective under t. 
learly other alternatives to the two 


Cuss ix 
es mixtures of the two models. However, there are с. 
this issue. 


mo i 
dels and much remains to be done on 


tions we must be prep 
question of the ‘robustnes 
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THE DISTRIBUTION OF MOMENT ESTIMATORS 


By Г. В. SHENTON 
College of Science and Technology, M anchester 


1. In a previous paper (Shenton, 1958) we introduced a class of moment estimator 
depending on the sample moments. Wenow consider the sampling distribution of a moment 
estimator and give expressions for the first four cumulants, these suggesting that the 
distribution is asymptotically normal. Ав а special case, when the moment estimators 
depend on an infinity of moments, the cumulants are those of the maximum likelihood 
estimator (assuming one exists) and our expressions agree with those given by Haldane 
& Smith (1956). 

Our main purpose here is to treat the problem for the case of a single parameter in 
and attempt an approach which will lend itself to the development of the multi-parameter 
estimation problem. Thus although we give an illustrative example we do this to indicate 
what complexity is to be expected in the method and not as a practial illustration, for 
most of these involve at least two parameters. The extension to several parameters involving 
simultaneous estimation is deferred, for there are several difficulties of complication to be 
overcome (see Haldane, 1953). 


general 


for 
2-1. Let P(v,0) be the probability of the variate ж depending on the parameter 0. Е 
the sample and population moments we write 

n (1 а) 

т, = Ут, т = У (v—mjyn, 

j=l 3-і 

10 
и, = «т, и = @т*. au 
terval- 


Tt is assumed that the moments exist and they they are differentiable in some g-in 
It is further assumed that the range of x is independent of 0. 


Ж jon 
2.2. The gth moment estimator 0, (when it exists) satisfies the determinantal equatio 
2 
Мф) = 0, ф,= 0, Ж 
where | 9 m m, .. т | 
| O ж № .. № 
ди (3) 
МӨ) =| 90 а № o Шан, 
ди, 
38 а Маа = flag 


This is an alternative form of the expression (5) given by Shenton (1958, p. 41 1) for Ше б 
of a single parameter. It may be derived by applying Schweins's theorem in determino" " 
(see, for example, Aitken (1946) or Muir (1906)) to the series expansion (truncated) 
moment estimator appearing in (13), p. 112, of my 1950 paper. Tt will be noted that with 


for 
the 
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onsists of elements which are functions of 0. It is 


exception of the first row in (3), Мб) с 
w in terms of cofactors as follows: 


convenient to expand A(0) by its first ro 


қо) = $ mAlO) (4a) 
= (m). (40) 


) we have introduced a notation for the vector 


Е or brevity Mis written for Al (0), and in (4b 
inner product of the vectors ™ = [Mo ту, сет and M = m will be 
noted that m has 4 random components, for m = 1. 


tic q-dimensional Taylor expansion for 0, about 
(5) 


er the evaluation of the inner-product operator 


I 2-3. From (2) we can write the stochas 
the point д, and in fact 
ji, and in fac в, = {exp (А(2|0т)) 0, | тер А-М» 


Whore M = m — jt. A an arbitrary vector. Aft 
in (5) we have " E 
0,= > DM, 0), (6a) 

== 
s! p, = (M Jom) 0, (6b) 


where 
valuated at m = /: this b 


vatives of 0, are to bee eing implied in 


and in (60) the partial deri 
ample. 


the barred derivative symbol. For ex 


20, mu +M 9%, 


Ф. = МА — 
Ps them, дт 4бта 
Я 2 
wł 90, _ 90, , etc. 
here = 
дт, On |т=л 


(2) partially with respect to Ms we have 


2-4, Expression for Фо Differentiating 
26 Ф) gp, = 0 6- TTL (7) 
" am, 09 NEA m 
rom which Met hm, = 0, 
d on (9) 
-- АН 
Where from (3) һ- gO = -<4% 
| я (20) 
We thus have Ф, = Rond im " di ке Me 
i as oP, „вз . 
То find an expression for Ф V? require terms ке can be avoided. and indeed 
ав (7) by dotes differentiation. This piecemeal procedure 
1 
rom (7), for arbitrary A and) / A A -AMY (11) 
2 om 
TW: 2 2 уа 
80 that (4 L еа A --(4 A) i 
AL OP S om / 49. 
- -(4 2) < дт 
Аы: | | 
is independent of m. Thus 2 ane} ЕРТ? (а “ту (а а " (12) 
CUNT) = 29 m 
Biom. 46 


CY ау А nh ® 
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But "о = (4 86 N hlg) 40% 
(45, дт, em/ дф? +09 0ф 


so that substituting in (12) leads to 


AS Аш) I m) cwm 000 


and on putting А = М and m = p, and using (8) we have 


21D, = -pon TE p Ot (ar. 58 (14) 


in which, after differentiating (3) twice 


> A OM ди ( qa) (15) 
КЕТТЕР) 


Тһе expressions (10) and (14) give the first two terms of the Taylor expansion for бр көм 
they involve only terms in the sample moment deviations m; — 44, etc., and the populus 
parameter 0. Expressions for ®, and higher terms may be found in a similar way. Thus va 
Ф; we operate on (13) with Aem) у, replace А by М and m by ji. We thus arrive ab th 
following expressions: 


310, = = ЗҢ ayes T м“ =) 


i 
6a) 
gU) (му өмі Я e aMMy (м A) (1 


4t @, D d 473 T P RCM acy (aft 
* 
) aM 
ye 9 саша -E ме» Qs P) 
45 2 6b) 
н UM) (и ud +g ОУ (и us » ( M pur » (1 
т = дд OM и @ 
idi 2 Є se) (4 ая) 
a e " (160) 
98 № 
т OM ди OM Ou OM dy diu 
la 003 2) e 35) -*( 20 а (4 a) 
2%4 (16d) 
= DA 


Be А iatelY 
It is of interest to note that the expressions in (9), (15), (16c) and (16d) follow immedis 
from the identity (Aj) = 0. 
ев” 
3-1. Expected values of combinations of the Ф’з. We require the expected value of expt 
sions such as (AMY (BM .... Now for arbitrary А 


т (17) 


2 3 
 еруМАЛ = [oe a gu 
where Vs = EKAS, X= ар, (r—0,1,...,q). 
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Thus (MAP = yan, 
&(M AY = van, 


«(МАУ- yan 3( — Пт, (18) 
«(МАУ = фута 10(0 — 1) Vans 
««МАУ- yrgln® ( (а-1) (105 -- 153a) /m? 5: 15(n— 1) (n- 2) ynë. 
Moreover, it is readily seen that 
y, = EEA hF 
= (Ap) — ( Ша an*( | t an - -- — 
(19) 


+ е É _ ) (Ap)? (Apr? + D (8 — 14Apy. 


in which A шу" is a sy ymbolic multinomial ago to be interpreted by replacing a term 


suc 
h as рди! by pian For example, when 4 = 2, 


(Ap? = А+ А +2445 t 24,414: 


+ AR fy + +24 Азиз + 


9. 
32. Derived expr essions. To ev raluate an expression such as 


G6 (LA) (MB) MO) 


We merely express this as “(с ai) (в zi) (МАУ, 


rs independent of m and 0. Thus from 


A, В, С being arbitrary — 
„garay = da? 40 - 
we derive n&(MA) (MB) = (AU) (By) - (4 (Ви). A 
For example nE l M) (BM) = (Mp) < Bp): йү» 
OM дш 
— Mi EA. d je)" == (21%) 
Ssa А = [ito Me A 
Similarly for third-order terms W° find 
“ Bp) (09 7 -— Mer aed АА 
м й (28а) 
M (70 (Me. 
ШМ? 0 ыу” ЖУ y^ et n) 
ч 2.4 у= xM м (бу D: (594) 
мы) = (Mp) 38" i 


те M) (я 


300 


Again, without going into 


weny (7 ӘРЕ ІЗДЕ) сарв) ar Gl): 


(240) 
4-1. Moments of 0,. To work out the third and fourth standardized cumulants y; = Ka] Kà» 
у» = Kalk} to order n and n-!, respectively, we require the expectations of the following 
Tomna: Linear Ф,: 
Quadratic Ф, Ф.Ф, Ф, Ф,Ф;; 
Cubic Qi. ФіФ. 
Quartic Qj dio, dio; ФТФ). 


Omitting the algebra, we quote some of the results in Table 1. 


Symbol 


2 
Фї Ф, 


qi 


Ф Ф, 


oio 
(m this table № = 


42. The moments 
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details, it will be found that 


пзе My = 3(n—1) (AS Ив, (24а) 


ТаЫе 1 


Expected value 


A ён 4 
чт ZEY +O(n-*) 


Anh). 


AY gilt T " 
СТЫ va 20% т) 


САУЛ "hy OM ер 3 
21273 a %0 20 +O(n-4) 
3(n—1) A? (р) 
ant ME 
-6 у= [в ды e 18 ам ап 
5 AM- 
nh} 20 20 | 


-fak Muy? МАЧ S. (Cu)? € 2 4-O(n-*) 
A3 ІЗ en a | 
эй! % «<р + O(n 7) 

ЖАСТАҒЫ 
e 20 DI pi Di Te Э) ) 


(0, -60,)" 


##(0„) = 


are now evaluated and it appears that 


в 
n3 (0) = "3 


: e ВАА (ar + gael у / деи 
СЕД en? 50 70) “© 508 


ыш» ЗАЗ, лз “Жу, а” 


= ә 5 1) 
—2hy May (S ah) brin v3), ү 
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uns oo MES ЗА? » 
Ш0)-- p? s C n 4 0(n9), 
(258) 


= ch vu 
ws nhà 00% 


ui.) = an-A, (Muy ACH А 
404) ЕН L3 ж T мые) +24 era | 


E 02 a 70 Я uà 
=ч быу io e- ғау (% 2) 
a aM стих JOM 
ГЕН Т 200 20° y Car слу + Aa =) 2а 1005) (256) 


4:3. А 
We now find as first approximations to the standardized cumulants 


3A? (к TEY- ША 
Ylha) ~ TAi (26a) 


PE: ys CADA „ы ди ad, 12% 
704) ~ див Ай e any 2020/] М 


oan 7, Pest 19А add Ot 
(a УЖ #0 M) i nC s 205 


з М 20 

12А OM 0 и 12 4 
“G 2 — 2% 265 
ae ЕЛ a) was p a" DM 

w 
Fas a clear indication of asymptotic normality- Tf P(x, 0) is linear in 0 (a case which would 
Е дақ for a Gram—Charlier distribution, the par rameter governing the corrective term) there is 
nsiderable simplification in the moments, and indeed 

(27а) 


VAU ) = 040007), 
1а AY -2 21 20) (Ми) EN es Ж 4 AR] ‚ (276) 


(01) ~ 
пй P 
e -e€ d и” 3 
m ~ NOS pos 
OM 
aM дп 4 2€ ) 
«Ми» _ eer Car t) 1 L Esc > 49. (27d) 
пу (0) “Rape — “АН 1 АМ 
It is T. hes when tl 
of der term. in @(@„) vanishes when the pro- 
some interest to observe that the firs en (1956) à T eis property perci 
e sed expect this to bo the 


abi 
lity is linear in 0; accordi ane t lineari 
nearly: 
- ә...) under certain 


Maxj 
ena likelihood estimators in the ¢ wo (ө.4- й 
Si р ors О жн 
nee (27a) holds for cam na that estimators wi nd this property should be 
as sugg › 


regu] 

e | conditions. Haldane 

ibed as being almost unbiased- . 
i joment estim 


5. there 18 а n 
Maxi Maximum likelihood estimators: : he case 107 inite g, and il 
mum likelihood solution (this may e tho ^ 


ator which gives the 
| any event for q > © 
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under certain limitations), then the expressions in (25)-(27) can be given in terms of the 
expected values of derivatives of log P(x, 0). Thus (using = to indicate correspondence) 


AS da bat EN (28a) 
ECC + (а). 


ey ‚ ,10POP (280) 
(4 У A = piss opi: 
BM N? Q/0N 18P A (OP A®P\? (29) 
U 6 ^ + > 
00 “990 Peo PO) | Poe 
OM OUN . GA ,(dlogP\? | , è log P? 21 aP P (30) 
= 20/ ^ aél 77 | +54 20 ) ШЕТТЕГІ 
amen 0N p 1 PZP , 1 (aP\? 2P "e a) (31) 
e zy 5-0  р+00 20% as (55) 00: А ра (age : 
2.4 дА [106 P\? | .,(dlogP\4_ Аз AY ae] (32) 
T eos E йе [I 3 5 И o 
«679 wel 5 al 20 ) S 77 ) zla ae 
The derivation of most of these is straightforward, but for (28) we note that 
dlogP . 
30 ЮМ 
where (MX) = ug am ul oa +..., 


and (29) is derived by differentiating (27а), and (30) from (275). For the sampling moments 


of the maximum likelihood estimator 0 we therefore find (after some non-trivial algobra) 
J (+569) (33а) 
Ре P? 00 20° E 
@(0) = 0— —wun “Ой ), 


„a т-1,.„овР\а 1 [әруз әр 12РӘР| ү, 
040) = зт -4( 00 ) Po) 20% тез as] e^ 


á ( | 24 aam) | e CX jen +0079), (330) 


2\” Р=00 00? 20 
x45 = (2) —зе(тазе ту )|/ Yero». (339) 
пе аи 
(овая) ] вО). (88 
where es (y. 


These expressions agree exactly (allowing for notational differences) with those given in (6)0 
Haldane & Smith (1956, p. 100). It is to be remarked that the forms given are not necesh ay} 
the simplest, and that there are alternatives for such terms as &{(1/P?) (2Р |00) (PI we 
in terms of the expected value of powers of (2 log Р|20) or their derivatives (in this conne 
reference may be made to Bartlett (1953) who gives an ingenious method of deriving S" 


identities). 
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6. Illustrative example Ir henton 958 the as totic variances ar egiven for he f 
ITS 


three moment estimators of 0 in 
Using tl p,-e(2-04(0—1)2) (0<2<0,1<0<2) 
ng the results in (27) and (33) w | 
ie s (2 ) we find for y, and уз for the first t i 
the maximum likelihood estimator the following: aver TIED 
уд) ~ OP- 1208+ 248 — 1201 0440-9), 
туі) ~ (7 00 + 4903 — 14402 + 1920 — 84)J( — 2+ 40— i E 


260663 + 14760? — 3000 — 16) (а), 


Vy (0,) ~ (12809 — 86405 + 21840* 
ту» (0, ~ ( — 614409 + 59,0040? — 243,45007 + 68,8960" — 901,04405--1,064,7060% (340) 
^" _ 934,5120? + 558, 1440* — 194,9280--29,340)/(о%/?), 
is а= 40—3. p = cap 199 - 120 395 
үл yi03) ^ pl (ar? | (840) 


туз(0з) ^ p' Kg? 
0 — 411,28009 T 689,4720° 


wher 
ere p= 675009 — 51,3000* + 180,504 
411,84062 + 131,7120 — 18,304, 


_ 854,4000* + 138,0880? — 
g= ВА в т=—15#+ 5663 — 480? + 8, 
284,318, 100077 
_ 15,171,819,9 
1,281,505,0240* + 5 


4. 1,478,232,8200" 
6008 + 137,243,452,80007 
2,015,042,5600% 


p' = —4,556,2500" + 52,245,000" — 
2,814,2240% 


= 7,531,774,3800" + 28,09 
= 178,312,054,8480* + 166,520,415,2040" — 11 
— 16,167,313,1520? + 9, 002,741,7600 — 252,062,208; 
Jn y) ~ (e+ 1 % 4)1-@+ 12/9315. 
о – (a log (1/2)? (0 = 2 арргох.), 
пу) = 3 (6€ DA — (ag U*( 
es (Bo? log (120) * 


a 4-62 — 1)a3/ (2D) (paa) 


where e gaspi У 
EE i 
Tt will «ation of the sampling distributions of 
be fairly evi 34) that noe 

y evident from (34) i d 

ле estimators such а8 0, (or higher 07 s derable undertaking. An 

pression ; 4 this exam is given In 3 : 

order E rc (a e variances in Fig. 1. Ав for y, and У», the 

ators is evident irom i likelihood estimator is 

hi is 0 = 2, forin this case the 


а; 
“Pproach of the second and Tue of the parame 
e 
. estimator diverge. 


О: ments О: Di x Қ 
1505 е first two standardized cumulants 
nt estimator. As a special case the 


7 

м 1 Conclusion. Expre 

ma nd y, of the sampling greeing with those of Haldane 
ximum likelihood estimator isincludec: 


Smith (1956). 


in the construction of these diagrams. 
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> Maximum likelihood 


estimator 


4 12 13 14 15 16 17 18 19 20 
Population value of 9 


Fig. 1. Asymptotic variance of estimators of 0 in P, = e-*(2—0-- (0— 1)a)- 
20 


vn у((0) 


Maximum likelihood 


"x 


И gp]. qp. 


Population value of 0 
Fig. 2. ./ny,(0) for several estimators in P, = e-*(2—0--(0— 1) x}. 


2р2 


ae oy 


Maximum 
likelihood 


-02 it | ЕЙ: БЕ: 5 = 
Population value of 6 
Fig. 3. ny(0) for several estimators in P, = e-*(2 —0-- (0— 1)). 
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EFFICIENT ESTIMATION OF PARAMETERS IN MOVING- 
AVERAGE MODELS 


Ву 7. DURBIN 
Research Techniques Division, London School of Economics 


1. INTRODUCTION 

Although the moving-average process is one of the basic models of time-series analysis, 
efforts at efficient estimation of its parameters have not been very successful owing to the 
intractability of the maximum likelihood equations. In this paper а simple estimation 
procedure is suggested which in large samples gives estimates whose efficiency is as close 
to unity as is desired. The limiting variance matrix of the estimates is evaluated explicitly in 
terms of the coefficients of the model. The idea underlying the method is to fit а high-order 
autoregressive scheme and to base the estimates of the moving-average parameters on the 
fitted autoregression coefficients. Reference may be made to Wold (1938) and Whittle 
(1951, 1953) for earlier work on the problem. 


2. MAXIMUM LIKELIHOOD APPLIED TO THE FIRST-ORDER MODEL 
Suppose we have a stationary first-order moving-average model 
я, = вре. (t=1,...,n), (1) 
where (6) is a series of independent normal variates with zero mean and variance 0%, and 
where || < 1. The variance matrix of 2,,..., p is 0?И, where 


14-5? ГА 0 Ме? 0 
р ї+# B : 
V-| 0 В 14102 : 
E LEN. 
0 та ay В 14% 


Using a result due to Dixon (1944) the determinant of this is [| = (1— posta -£ a 
which tends to 1/(1— /?) for large n. The inverse of V, is approximately 
1 -B 2 ... (-gna 
-£ т e з 


1 
тж] ё =i : 
; 2% —f 
(eS uo s -£ 1 Р 


Thus the likelihood is approximately given by 
= 2 1 VAMOS 
log L = constant — } log (1—/2)— доз ур) 7 Т — Bey ay + BPE 1 


Since the term }log (1—4?) is of small order in n compared with the remaining part $ 
log L we may neglect it to give the approximate maximum likelihood equation 


8 Т 
ap — = rab — 2872,0, + Зал, | =0 (2) 


| 
| 
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cf. Whi 5 i 
( hittle, 1951, equation 7:523). On attempting to perform the differentiation, however. 


wi ; 
pa ourselves with an unmanageable estimating equation. 
he other hand, the simple estimate obtained by equating the theoretical value of 


the firs А Е 
e first autocorrelation coefficient p, = 2/(1+ 2°) to its empirical counterpart 
ry = плаца (п — 1) Хай) 


for Ё = 1 Whittle (1953) calculated its 


is known to be extremely inefficient. For example, 
$ of the maximum likelihood estimate. Whittle 


E oes variance to be 3-8 times tha 
iis ngly suggests an adjustment procedure intended to bring the value closer to the 
ximum likelihood value but this appears to be rather complicated. 

In the next section a simple but efficient alternative method is suggested. 


PRESENTATION 
1) has the infinite auto- 


(3) 


3. ESTIMATION BASED ON THE AUTOREGRESSIVE RE 


It is well known (see, for example, Wold, 1938) that the model ( 


| Геот 1 $ 
Bressive representation 
-6 
p 


ay otia + Osea T n 

er k+ 1 terms of the series t+ &{ tı- +... is 

( — )Е*! (а-а-а — fugat: ..) = CT EF Gk 

n has variance 2+2. This > 0 rapidly as k > © since || < 1. Consequently the finite 
presentation дұғада tn + ty Xp = & (4) 

by taking Ё sufficie 


large, we shall alw: 


Whe , 1 s 
re a = ( — Ji. The remainder aft 


ntly large. However, itis important 


can 
be made as accurate as we please Ў à 
aysin asymptotic arguments regard 


t 

pee that although Lis taken to be 
» small compared with n. 
aet c. ..., a, be the least-squ 


i.e. the estimators obtained by 
s of Mann & Wald (1943) we 


а, and variance matrix 
ҙа, have the 


ares estimators of ад, Oe 


Dinimizi ы "ES t 

пишите Y (аад: 7 9 . From the resul 
i 

ically nor 

matrix of zy» -* 


mal with means ær > 
Consequently, d, -- 


1» vey 0р ATO asymptot 
se 


zi 
k/n, where о?у, is the variance 
pts k 
Pptotic distribution 


i k = JE dap (5 
ар И заж 2,077 - pda (5) 


k-l 
PER 2p Ў (a; — di) (i 
ia 


(2л)%® TT 
; қ isfy the relations 
Bince 94, ..., hy, аге autoregression coefficients they satisfy 
m g 
=-6 
000 qao toit ы 
= — 02 
аб +оо е ROE 2 
Ез: g? and c, = 0 ("> 1), we obtain 


Whero ос, = JE (atis) Putting б = (1 +P") оза” А 
(1+8) +8 7 Rod IN 
fo, Porte 3 н E 
paratt f?) “к й 


,, b— 1) 
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Multiplying these equations by —2а,+ д (i = 1,..., k) in turn and adding, we get for the 
quadratic expression, Q say, in the exponent of (5), 
k k-1 
Q = (1482) У («;— o +28 У (а,— ti) (Giza 7 жы) 
ізі i=l 
с k-1 


ai 2f Y, ағара За, — Вол. 
1 il 


Me 


= (1+ 6%) 


| 


Since, for large k, ә) is nearly equal to — / this gives, on putting a = 1. 


к. к-1 (6) 
Q = (1+/°) Жа эр Х ааа 1. 

to а high degree of accuracy. jà i tribution 

We proceed to estimate // by maximizing the likelihood obtained from the distribu 1 : 

of dy, ..., а. Asin $2, |V,| = (1 — £2**2)/(1 — 2) which for sufficiently large k is approximate - 
equal to 1/(1— £2). This, however, is O(1) whereas the exponent of (5) is O(n). Consequen a 
toa first approximation maximizing the likelihood is equivalent to minimizing the quadr ie or 
form Q. Differentiating Q with respect to / and equating to zero we have for the estima 


of f, 


a " 


. | . : " : 1 od 
This estimator is manifestly much simpler to work with than the maximum likeliho 


k 4 
| | к-1 y i 
estimator obtained from (2). As n increases it converges in probability to — № ча / Ж : 
| і-0 . "ge. 
Where Oy = L and this may be made as close to В as we please by taking k sufficiently lar аг: 
Since there is one term more in the denominator of (7) than in the numerator there p 
| | к-і У аў 
haps, something to be said for dropping one of these terms, i.e. taking b* = — atu = | 
. . "Ley H T n | 
аз the estimator. This possibility will not, however, be pursued here. A further refiner у 
would be to take account of the determinant |V.| in (5). This would give the estimatin® 
equation i 


ГА 


"а-)97 (Уай + Eaa) = 0 


which can be solved iteratively. However jigible 
. ‚ the effi ^ i negli 
nier effect of the extra term is generally nes 


4. EFFICIENCY or THE ESTIMATOR 
The minimum asymptotic variance of consi: 


by 
А Я stent estimators of 3 has been evaluated 
Whittle (1953) as (1— /?)/n. Whittle also shows that this mini 3 ined by the max} 
mum likelihood estimat i 5 | мии ЕН i 


e. 
leas " jently 1878 
argument similar to that u pens, анық IMET the 


dP = (1 = pt 


ii | 8) 
(уы — €xp(—inQ) da, ... dap, | 
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wh is gi 
here Q is given by (6). Let us write this in the form 
ар = (1 - #37) йа. 
а i à 
nd let us write the integral of this with respect to (1, --- ay аз 


(а — go-*f(Q9)da = 1. 
T : 
ІСІМ 


Di -— 
ifferentiating (9) and dividing through by n we have 


[ioe -о(). i.e. в] =о(,). (10) 


бп а "T | 
and differentiating agam. 


|е = (5 СШ -0(9. ie. (=) 2589) of): a1) 


opp 2e 
Now 20 _ ЕЗ k-1 220 _ Q5 a 
в = 2р әш 242 P ауаға and ge 2 Xd 
Usi 
sing these results together with (7), we find 
20120) 
p-p-- 
B= opp" 
80 that V(b) = ке apy 
x i? 


(20/0831 to the first order, and 


tc қ 2; 
о the first order in №. Observing that BERAE =. 
totic result 


usi 
sing (11) we have the asymp 
2 
VO) = ақа”) 
arge Ё tends to 2 Pi ze 40-05. 
ne 


T 2n k o 
© the first order in 2. (zs) m x ой, У which for 1 


T ; | | | 
hus for sufficiently large k the asymptotic variance of bis 
1-5 е 
7(b) = ig я 
V(b) 3 
азс 
losely as we please. 
5. нтанЕн-ОВРЕВ pROCESS 
The extension to higher- -order processes follows along similar lines. * Suppose the x satia 
13 
X= e fie pee tBu (13) 
alt f qr дез +B E сай, 
hs f sof the equation , 
а ere the е? are as before and where thet? t ipe nes d td дема 
E modulus less than 006: 3) can be à 
he finite autoregressive process 
x, 00A NT = êr 
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i f a, ... 0 0 
As before the asymptotic distribution of the least-squares estimates а),....а, of o i 
As befo: 
i ximately for large k, 
TUM i - nt Бр eb (— 3nQ) da, ... da,, 

(Әл) x | 
i atrix 
x "the variance ma 

where, writing а, æ for the vectors {a}, ..., ау, (8, ..., ж} and a?B for th 
алды Q = (а-а) Б(а-а) 


= а Ba— 2a' Ba + a! Ba. 
;here 
А P : : в = 0, wl 
Since а is a vector of autoregression coefficients it satisfies the equation Ba + 
€ = (6. ..., Cp} and gc; = (аа. ,). Thus 
Q = а Ва+ 2a'c — a'c, 
h 
where a'c 4 1-- Y fit is the constant term in the expansion of 
ізі 


h | 
(1+012+... Жара) Y) ci. 


i=-h 


Now x c; = (1+ а-ы... zh) (1 EM. fuh) 
and ls 1+012+... А = (14+ 2+... +82) 
to any required degree of accuracy by takin 
80 that 


h 
> 00 = 1, 
g k sufficiently large. Thus ac + 1+ Lf i 
^ T 
Q = а Ва+ 2a'c 4- Ef. 


ізі 


Ав for the first-order 


process log | В| 
and may therefore be 


ith me 
is found to be of small order in n compared wt 
neglected in de: 


lete- 
riving estimating equations for the Рв. For шқ the 
ness, however, the evaluation of |B| was investigated for a second-order process a rge Ё 
Appendix it is shown to have the limiting value UEA — fy)? {(1 + Ba)? В for ree i 
Neglecting ІЗІ, on differentiating Q with respect to P; ..., An and equating the deriv tions 
to zero we obtain the estimators b,, ..., б, of f, < Ёз ав the solution of the linear equa 
k k—h4l 


2i jd, | 


k-2 
: 
id аға, 
imo oo d 5, UM (келей 


M 
a 
‘a 
e 
= 


i=0 
" 
2 Е b У aiti |, (15) 
i-o А % i=0 
k k—h 
x QOL. Xa b, У а; ern 
i=0 =0 г і-0 


The asymptotic variance matrix of by, ...,8 


| 22 -1 ich 50 
һ is approximately „| ғы | incu 
the first order in n is [7 т, w 


ор, о); 


т 
here 


k-i 
" - k-h41 
2,4 299.1 . 


= ке E OO uc 
: iz 
Gat H “ы 
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H k-s 
ere, PLE +s Ís the coefficient of 25 in the expansion of 
(140324 .-- a zl) (1-05 27 + -+ aptos 


he coefficient of 2° in the expansion of 


уа ЙМ. 
in the autoregressive series generated by 


which for large k is nearly equal to t 
(1+fye+-- +f, 


This is equal to the covariance of Yı and yrs 


yit Éet -- T fus = Ep 
hus U-1is the variance matrix of А successive 
be obtained by means of the following 


(16) 


Where (£3 has zero mean and unit variance. T 
о : s н 
bservations of the series (16). Its inverse, U, may 


considerati 
ions. 
NAE Y, the vector Ун» 


Let Y, denote the (column) vector 01». | 
1 the conditional density of Уз given Y, are 


sls). The un- 


conditional density of У, anc 
E Yn)” 
K,exp(—1YiUM) and ТЕ 


respectively, where Ky and K, are suitable constants. Thus the unconditional density of 


UE 2h 
уру, y -— Exil. 17 
K,K,exp ЕН ҚАЗЫЛДЫ + Paden) | (17) 
ditional density of Y, given Y, are, since a stationary 


and the con 


regard to n along the time axis, 
o 


Similarly, the density of Ys 


сынық ali of Ya mate 
me series is symmetric with directio 


4 І У ET н. 
К,ехр(= #121029) and K,exP | =5 PEL m 
Thus the unconditional density of yy m 8 


p+ 730%, |. 18 
[д6 ) ұшы) (18) 
2 (t=1 


e density. Equating the first 
(17) and (18) we have 


2 Д 

К, Кзехр +... ВЫ 

{ i i i resent the sam 
(17) and (18) must be identical since they гер eer 


rows and columns of the matrices © the d ) 
1 Ws 1-1 

л ЕЛІ R " М 
: А 7 +й : 


Uy + Bi, а-а-а Ph ant 
2 
et V fiat Bi 


А 14 fi o Pha 


а 


Uin +8, Bit ie 
9) 
n E we find - A Por Мал С” Bii АРА 
— f go n-i : 
S E. : 
U В: — By : 5 % : . (19) 
- Въ Ваа ГА + fafa ае ез x | "WA Въ Bia 
: із nm» 1-8% 


By a Вав, 
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Values of U for h = 1,2, 3 are 


Xd 165 А = 3 


" 2 == В: fla — fifi 
- fi: 1-4}  A-AB]: 1-A А-А» А-А EL 
Р ‘ Post 1—28 | P3 — б» Bc т a 
Въ = В.з fh — Pall 


as made 
The assumption of normality in the distribution of the езіп models (1) and ( "S nes & 
for convenience and can be dropped without much loss. The justification is pesi only 
Wald (1943) in deriving the asymptotically normal distribution of а, cà pex having 
that the е? are independent and identically distributed variables with а distribu Guat 
zero mean and finite moments of all orders. On these wider assumptions the кз an? 
and 6,,...,b, derived above are still consistent in the limit with Шаар a hat we 
variance matrix given by (12) and U/n, where U is given by (19), Бе желдің 5 efficient 
lose by adopting the wider assumptions is the assurance that the estimators a 


f 

ficient in the class 0 
However, it is possible that the estimators could be proved to be efficient in 
estimators based on the sample serial correlat: 


the asymptotic normality of the serial correla 
taken together with Bartlett’s (1946) 
serial correlation coefficients are inde 

The assumption that 
sight. It is customary 
tial indeterminacy ari 


"onm 
ion coefficients. This possibility M те 
tion coefficients of а moving-average а кті 
result that first and second moments of the S^ 
pendent of the parent distribution of the 68. - аб first 
|Р | < 1 in model (1) is less restrictive then might appe дің essen- 
to impose the requirement |A| < 1, since otherwise there id of ob- 
sing from the fact that we cannot distinguish from the stu ations 

8 generated by the model with В = fy and prr w the 
= 1/fy. The indeterminacy disappears if we fo оз 
1, the method proposed breaks down since the autores ns also 
representation does not converge. However, other methods based on series иш is no 
rm of the likelihood given in (2). The po mark 
ce since such cases are hardly likely to arise. Similar i have 
for model (13) that the roots of ahy atit ape uui РА = 


6. REMARKS ON THE ASSUMPTIONS 


It was assumed in $3 that t 
Squares. Many statisticians will 
from the sample serial correlati 


he estimates а, .. 


; however, prefer + 
on coefficients т 


d by 
oak OF ay, ..., or. are calculate alculate 
© work with estimates a4, ..., 47; © 
“ЭТ Using the relations 
, 
a+ та... + тату = 0 


, , 
TiG + Фо... Ет aa, т, est 


Я Ë dca. 
Ti Ty за esed 
Since there is no difference in as 
squares estimates а)... ақ, 
оп (5, ..., d, аз well as for es 


Ymptotie behaviour between ат, ..., аз and t bas 
the asymptotic theory Siven above holds for estimates 
timates based оп aq, ..., ақ. 


һе least" 
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7. TESTS OF SIGNIFICANCE AND TESTS OF FIT 


struction of large-sample tests of significance. 


Тһе above results furnish a basis for the con: 
lculate b from (7) and test 


Thus to test the hypothesis В = Во in (1) we ca. 
z= Jn (0-A) 4-29 

iate. Similarly, to test the hypothesis 2; = Puah. 

degrees of freedom, 


as à standard normal dev. yh) in 
(13) we t АА "X А 
est ш = т У; X u!(b; foi) (6; — Ро) азам variable with № 
" ігі j=l 
p [wii] = U- and U is calculated from fig; ---: Bon using (19). 
test of goodness-of-fit of model (1) is obtained by noting from (5) and (6) that 
k k-1 
пд = " {a +f) X ai 38 E шана 1 
i-o 450 


with k degrees of freedom. It may be verified by sub- 


is approximately distributed on x 
titioned in the form 


S 1 . 
titution from (7) that Q can be par’ 
k ә Е 9 
9 = (1-0) 5-1 4 (b — PY BG: 
i-0 i=0 
sion sum of squares ina 


2 Xaj is equivalent to the regres 
residual sum of squares. 


svmpbatally, the term n(b — Й) t 
near regression model, while the remainder is equivalent to the 
el may be examined by testing 


Thus the goodness-of-fit of the mod 


k 
nQ = na -i) X di- 1 (20) 
i 
ана x? vari j ces of freedom. 
T riable with 6-1 degrees 0 ^ AMT ЕТЕКТІ MÀ 
_ ne test for the general model (13) i$ obtained by Fri niin : 
* inimizing the expression Q defined by (14) with respect to fao n am sia xl A E) 
ог, ..., f, in (14) and using the relations (18) we find for the minim" ? 
k h j (21) 
А өг 56, 2 а.а 1 
9 -Xita e alll - 
; ^ 2 variable with К— de Tees 
The goodness-of-fit of the mo del is tested by treating 10 25 2 y? variable g 
n that suggested by Wold (1949). 


9f freedom. 
Jer to use tha 


This test of fit is clearly 8 good deal simp 


IE NUMERICAL RESULTS 


о twenty series each containing 100 obser- 


P g. SOM 

i t 

tst-order moving-averages have been fitted 
m= tt Зент 


Vati 
tions generated by the mode -— — 
ish Electric 9 


ted on the Eng a 
es. Denoting the 


aleula р 
ates © те fiti to each seri 


Th ; 
e fc 1 devi 
са] {e} were pseudo-random normal d art E s 
i ү 
те culator, Autoregressive MO els of or verage parameter was then estimated by the 
Sulting coefficients by 41 ++"? а; the moving-@ve""° 
Orm: MUT ds 
"mul; Жс. = 
Ы P ажаа 
b= 1+a4+-" 5 Biom. 46 


?0 
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i ion r, was also 
For each series the simple estimator c obtained from the first serial correlation r, w 
calculated. This is given by the formula 


c[(14-c?) = n, 


: : taken 
taking the root having modulus less than one. Where the roots were imaginary c was 
to be one. The results are given in Table 1. 


ТаЫе 1 
Series b c Series b е 
| | аны (а gik кк қ 
1 | 0430 | 0405 n 0-5199 = a 
2 3639 -4966 12 | MEM 
3 3851 | -3509 13 | “5278 Ep 
4 75 | -4579 14 | 4856 Eos 
b 4298 — | 3862 15 | -3141 "3524 
6 0-5466 0-6665 16 0:3380 1 0:2070 
7 -5326 -6106 17 4005 ко 
8 4156 6212 18 +5598 1-0000 
9 4926 1-0000 19 "3993 beer 
10 4142 0-2839 20 :5069 чан 
Тһе sample means and variances are: 
b с 

Mean 0-4531 0-5126 

Variance 0-00536 0-0398 

S.E. of mean 0-0164 0:0446 


. . i x 
The claim for the efficiency of b is well Supported by these results. The observed og 
namely 0-00536, theoretical value of xis {1 — (1/2) = 0:0079 


А ig that 
arlance of с. A further point in favour of b is 


yel 

ween the observed mean of 0-453 lue of $8" 

31 and the true value of 2 ut 

over twice the estimateq standard е ed, р 
rror. The followin es were 01100» 

1n no case was the Improvement satisf, и à 


8: 
1 resul 
Islactory enoug] ts ; ion of the res 
the order of the fitted autoregressive sch Д 153 to warrant inclusion 
1 чода 
at the end of §3; the term — $ log ( ethod 
or аз described at the end of § 3; Quenouille’s ™ 


-0 
Н » 1959, for А is matter 
bias clearly calls for further consideration. 9" details of the method); “Tins 


i Y aft 
of те. : ж та `М. G. Kendall for some valuable corrections to the first ar 
9 Miss J. Grahame and Miss J, May for assistance with the c omputatio 
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APPENDIX 
Evaluation of the limiting covariance determinant of the second-order moving-average process 


Let D, be the nth-order determinant 
а % s 0 О а. 5 0 
babe? 
c b a b 6 
Oo a D g 5 | 


0 o v € 
24/5, = Ait Aiba с = Bs 


е case where @ = 1+0 | 
on the fifth-order difference equation 


| 
| 
| + 
0 
for th 


Ww 
© proceed to calculate lim Dn j 
after some reducti 


Ex ; m> vo obtain 
Xpanding by the first row We ош ж s sop anto ыы =0, (A1) 
=i ioe (ae izes in the form 
Where zr) i pstitution for a, b, € We find that (А 1) factorizes in the fo 
z'D, denotes D,,.,. On subst 252 =0. 
" ВЕ , ye notes sos "m 
(=—1) (0577 zi 
Let gie «GP Ta =» 
T " 
hen C, satisfies the difference equation (8-0) e, = 9. (A3) 
— ge 
2-06 р > 2 provided we take Dy = 1, 
> fied for n > 2 P 
А Al) is satis 
On - hab equation (2 
1 examination it turns Out tha piam 
20, 4 =. Substituting in (A2) we hu 4) 
a 
C= L в y 
т (Ra? а. 
2 - MC A 
2n 
" р 
Th . - Md quent ho 0 has roots with 
© general solution of (А 3) 18 "TI pul pd 0, = Substituting in (А.4) we 
y; d n 
move en specifying the moving-average A < 1. eque! -+0 
dulus less than one. It follows that |» 
fing А, =(1 lim G 
= — -2 = W п" 
Fro Ps) n 1} lim Da Яо ". 


m (A 2) we have that qa aos Pt ДЕ 
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1 
Ша D, = А. 
Consequently, bor (l=)? ( +22)? — £5 


It is remarkable that this limiting value is the exact reciprocal of the very similar determinant 
1 Ay fs 0 0 эй 0 
А 1+0 FO р») By 0 
B. В+) 1+ї+% RA) 3 

"710 B дағ 14+ ЫНЫ 


(А5) 


" ж 2 : | 

$ 1-і Ж, 

0 m vss gi ә Ay 1 
which differs from D, only in the first and last two rows and columns. H, is the determinant of the 
reciprocal of the variance matrix of n successive observations of the autoregressive series 


M+ Руза + ұта 6) 


where e, has unit variance, Thus the limiting covariance determinant of n successive values of & second- 

e € moving-average series is the same as the covariance determinant of n observations of а езеді» 

dira eos regressive series with the same coefficients and the same residual variance. It is likely that 
result is generally true and it would be interesting to see a general proof of it. 
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THE COMPLETE AMALGAMATION INTO BLOCKS, BY WEIGHTED 
MEANS, OF A FINITE SET OF REAL NUMBERS 


By В. E. MILES 


Statistical Laboratory, University of Cambridge 
1. INTRODUCTION AND SUMMARY 

КЕЗЕ discussed below arose in Bartholomew (19594), in which is proposed a test of 

ogeneity of ‘ordered’ alternatives. 

On p. 40 of this paper Bartholomew 
proceeded to determine its values forl<l<k<4 and general a; (i = 1, s E). 

In the special, but important, case of equal weights (@ = а=. = ар) it may readily 
be verified that P(l, k; €p s; +734 МС independent of the common value of the a;, and thus 
may be written Ра, k), say; this notation departs slightly from Bartholomew’s in the use 
of a circumflex, but this is done to avoid confusion in the Appendix. Bartholomew deter- 
mined values of Pi 1) for 1 < l < k < 5 and, on the basis of these results, conj 


ectured on 
D. i 
р. 43 the recurrence relations 


defined the probability P(l, ke: а, 05. са) and 


2<1<1, (e 


зек 
Where Ра, = i (4%) 
We now make a brief survey of the present paper. 


Summary 
the notation and ter- 


d as a set of n real num- 
5 satisfying а certain 
lete amalgama- 
ing weighted 


of Bartholomew (19594), and 


No knowledge is pre-supposed w (1 
tent, In $2an «n-collection ; def 
1 п posi в, called ‘weights 


S differ to some ex 3 
ers, called ‘ordinates’, and” positi тау” of [n]. The ‘comp. 
asi Ф. Ibis then possible to de " нет <plocks’ by decreas 
n the amalgamation E amalgamation process—is coe > Же 
with ti а generalization о process 9/2 the complete amalgamation 1140 
1e aid of an equival defined as the prob- 
be unique 
of [п] yie d = P(k,n), independent 
of the рар“! the P(k,n); in the 
inuous probability 


әу are give 
(10)-(13) are £ ample from а cont 
19594). The generating 


^ b 
амы to Theorem 2 the ordinates ат k with B rtholomew ( 
Istribution, and this provides the final Im e aking f one of these expressions, we 
"пей: ' mined in $5 and, ™ 
ns of P(k, n) are determ $ 

Obtai т > 

in in xa Lus 

$6 the formu "T \stl 


gne have ә different meaning from those used by 
«ебі › we define 
+ Tt should be noted that the ‘weights 


1 
Barthol bee: 
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. eas : а 
S* are the Stirling Numbers of the First Kind. General and limiting diac ied bud 
ke of values, of P(k, n) are given in $ 7. Finally, in $8 two related problems аге disct 
а ta А , Ai finally, 
‘which both yield an identical probability distribution. 


i ix w ve obtain 
The conjecture (*)-(**) is in fact true. This is proved in the Appendix where we o 
the result 


" s" 
Pik, n) = Pik, n) = = |88. 


; : letely 
On account of this, Bartholomew's test in the case of equal weights а; may be comp. 


ы : ib 
specified. Further details arising from this development are included in a second pap 
Bartholomew (19595). 


2. DEFINITIONS AND NOTATION 


ч [PT , sitive 
We consider a set of » real numbers X v- X,—the ‘ordinates’, and a set of n po 
real numbers W, ..., W, —the ‘weights’. 
We define a condition 


А ; the 
@ on the weighted means of the ordinates with respect to 
weights as follows: 


А WX, + + WX, та ы Ед И.Х реа 
| Wort И | Wirit + Woe, . 

for any two ("+ s)-permutations P(Py ++ Peas), p' (pi... Plas) of the integers 1,...,7Һ heng 
I<r,s<n-land2<ris<n, E 

Imposing € in what follows enables us to use strict inequalities (>, <) instead of we a 
ones (>, <). In particular, taking r = s = 1, we see that € requires that X; + X; m 
Le. the X, are n different real numbers; however, % places no corresponding restrict! 
on the W,. 


We define an “n-collection’ 
it by [n]. By an “n-array’ 
—the first row being the 
gives rise to (п)? differe 
(1 € r < n) is meant an 


as а set of n ordinates and n weights satisfying €, and deos 
is meant a 2 x n matrix whose elements constitute an venio us 
ordinates, and the second row the weights, in some order. [n] em 
nt n-arrays—the ‘arrays of [n]’. By an r-subarray of an eps 


] 
7-array obtained by deleting columns of the n-array 80 es = 
in. % holds a fortiori for the elements of subarrays. (Dropping 
ion, array and subarray have 


s obvious meanings.) а 
| We select independently, at random, two permutations lT... Tn), o" (04... Tn) of be? 
integers 1, ...,п. There are (nl)? ways in which we ean do this, all equally probable, and 
о permutation-pair (с, о”) has probability 1/(n!)2 of being selected. 

We call 


9 ae RE 
Swn = Ce . нт) 
the n-array corresponding to our random choice, a ‘random array’ (n.A.) of [n]. 
We write Zi-W dixe 
: = WX Wax, + EE ШЕР ее 
Wi == Wat Woe. + ee + Wy, 


i Zi CRM 
i= Hi (a ‘weighted mean’), 


; i 
and Yjasthe? x 1 matrix = ‚ where 1 


Si<j< 
and that we may write 4 


n. It will be noticed that Yiisa -array 
с,” ta 1 
S55 = (Yl... У»). (1) 
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3. THE COMPLETE AMALGAMATION OF AN ARRAY 


Soe is taken as the array. 
The amalgamation process 4. We operate stage by 


5 avi : в 
tage, having started afresh with an r-array (2 <7 < 1), either the process terminates or 


we finish with a new (r — 1)-array. Thus, after the (n —7)th stage, we have the r-array 


stage on S75, as follows. In each 


(2) 


а no further, the process being said to have terminated. 
Үй-н Үй), where 1 < i <r—1,in S77 such 
this case we choose quite freely апу one of these subarrays and 
ay Yr thereby obtaining the (r— 1)-array S27 We thus 
Sez, 5699 1, + E 87 the process terminating after п Г 
æ is such that Xj, >... > дїї, It should 


ses = (И... Yr") where 7 = 0, 7, = m. 


Tf XO s... > DE Wai prooee 
Otherwise there are one or more 2-subarrays ( 
that Xy+ < ХИ. In 
replace it by the l-subarr 
obtain successively the arrays 

| k-array 89%, 
if S71 is reached. 


stages, where 1 < k < n; the fina 
be noted that the process automatically terminates 
We observe that 8%7 1 has the same basic form as 507,88 written on the right-hand side 
of (2); by (1), 8%” also has this property- Hence, by induction, so do all the Sar (k<s< n). 
his ensures that the process із à valid one and may always be carried out as described. 
We say that the R.A. 8% of [n] has yielded Ё раторы qoi dn this ‘complete 
amalgamation’ (complete amal.) by Žr 
Tt is seen immediately, from the nature of Ay, 


Unique, 


that 875.55 599 are not, in general, 


y first defining an alter- 


Howeve tk 

r, we now prove tha ° 

native am | M. 28 ives a unique complete amal. Then we demonstrate 
algamation pro ne nal. by 27, with the complete amal. by La 


(Тһеоге 2 f 
m 1)theexacte uivalence 0 г ВЕ _ yl. уһ 
| : 4, We take as frst block Уц, where ja c Ap XE 


The amalgamation process 47: г 
ph he process te 
max. xi" and so on. The р 


8 
cond block, where Xi" Teen 
zh , plete amal. of 


e. This we do b 


"mL 
rminates after l 


2,0’ into l blocks by Lo 
Steps, where 1 < 1 < n, giving the com d а Р uhi Н let 
ә а 8 |, of an array by is identical with its unique comp ete 
plete amal. 


Тнковвм 1. Any com 
amal, by sg, 
Proof. We first show, aS 


В с X1, that 
e of Xi, being ma», p 


> xpi 


(3) 
ality 18 excluded i£ T; — 1. 


a consequene 
" 71 
Же Xi, 


А he first inequ 
for all $, j such that 2 € i <, and ie E d 
Or suppose the first parto is OPE (4) 
gi < WiAw 
= (5) 
By definition iz р ер меде contradiction. 
Adding (4) and (5), we obtain Zi, € mix oll Then 
0% suppose më second part of (3) dit ки Ag (6) 
Тыр сына (7) 
af 
By definition ті Хь = k^ another contradiction 
Adding (6 „ыш tp et ae 
) and (7), we 09% d 


The proof of (3) is thus complete j 
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i ible i : dinate of 
immediately follows that no amalgamation is possible in .2/, between қ. = gor 
аем Xj, with one of the type X^*!. Thus 42, is at least as fine" ав VL. ; » жет 
қ ua к 2 to prove conversely that .ғ2; is at least as fine as Z}, i.e. that in а 
re 


Y i k—into 
al. by «7, the [,-аттау (УТ... Үр) cannot be separated into more than one bloc 

amal. by 7, y А 

t blocks, say, where і > 2. 


For, suppose this were possible, these blocks being Үр, LE, Then к 
Xp > x 
by the first inequality of (3). Also, since this is a complete amal. by .Z,. " 
Ж} э» „в Хен, ш 


(8) and (9) together imply that Xj, > X}, a contradiction. lock in the 
Hence the first block in all complete amals. by ZZ, is the same as the first b gr = 
complete amal. by A, We may now similarly prove that the second block has the : 
property. And so on. 
We may define an amalgamation process 47 which is the ‘reverse’ of Z. 


А ra. 
^ ro Х”т-:+1 = min Хп: 
T'heamalgamation process 7. We take as last block Үтт-1+1, where Хт": гур 


ris rocess 
тлі as penultimate block, where Хи" = тіп. Хи, and so on. This pro 
| т-1 ТИЕ тт-а goo" into т 
terminates after m steps, where 1 < m < n, giving the complete ата]. of 5775 
blocks by „22, iatement 
We may prove the theorem resulting from the replacement of .0/ 2 by Z, in the sta 
of Theorem 1 by a similar method, 


à . ocks’: 
Thus we may now speak without ambiguity of the ‘ complete amal. of an array into Wu 
without specifying w 


hich of the processes 27 1: Za or Z, has been employed. 


4. THE PROBABILITIES P(k,n) 
Let * be some condition we may impose on a R.A. of [n]. We define 


Pul, | *) = prob. (R.A. of [n] yields Ё blocks | к.А. satisfies *) 


— n0. of R.A.’s of [n] satisfying * and yielding k blocks 
no. of n.A.'s of uti 


As of [n] satisfying * 
and Tib, n) as the corresponding prob. with no condition on the n.A.'s. PET 
We notice that Ff. n 1%), Fab, п) are only defined for integers Б, n such that 1 < $ 
THEOREM 2. P 


H + гі ten 

(К, n) is independent of the n-collection [n], and thus may be s 
P(k, n). The P(k, n) are determined by 10) 
(у т 


and the three recurrence relations (for n > 9 


) 
Рат) ="—* Pa, n), (1) 
Р(Е,п) = lip. i = (12) 
> " ( m D*&-1) Pd a. 1] {@ at nes) 
and P(n,n) = * Pin 1,n—1). (19) 


Proof. (10) is immediate. 
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orem 2, we proceed by induction. Let 2, denote the pro- 
r, and thus equals Р(/, т)). Let us suppose 
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To complete the proof of The 
position (Б.Г, r) is independent of [r]forl < k < 
that Z, , is true. 


L [o = X * s 
et Хо = max. X, X occur n, in each of the ‘positions? 
і-1,... 


s, with equal probability 1/ 
.. n of a в.д. of [n]. 

such that each of the (n!)* arrays of [n] 
п} е 1-subarray 


matrix) 1,. 
into ‘classes’ a, г» 
а в.д. belongs to a; fis Tye. 


s) 7 LG) 1,..,5; = beh 


(synonymous with columns ofa 
7 йв 
3 Wesubdivide the arrays of [n] 
сгш to one, and only one, class. We say 
m E. 
E: | oceursin position lofthearray. and tof}, Ü-1L. 
i 
where i’  j'}if the 2-subarray ү: "d ) (n= 1,0) 
yg 


| occursinany of the positions (1, 2), (2,3 


Ub bag. (We emphasize that а subarray of an array contains adjacent columns of the 
array.) i 
It is easily seen that the above classes are mutually exclusive, and toget 
(n!)? arrays of [n]. 
) 
The classes g, // have dies 


her contain all the 


ame property if we write 

_ yyy fi 
Sox and f Gig 
“1 si А all admissible 
| ji 


гє 
We now consider these classes individually. 
“al will obviously be the first block. If in each 


]-subarray (қ 
- one of the arrays of 


ting the ordinate Xt and the weight 


he corresponding member of 24: 
s. But our inductive 


о. In a complete amal. the 
Ач ога, we delete this subarray from position 1, 
р (n — 1)-collection [n — Hr obtained from [n] by omit A 
ie Hence, if an array of [n — Че: «olds k— 1 blocks: pene — 
nen k blocks; each member 9 as 
ponens applies to [n — He 
ence, for n > 2, г. 


we obtain every 


0 


Pa (Lale © а) = 
8 5 | 
ала B.E. n | RA € а) = D, 1,n—1) a 
[nU : p(k-1 n-l) CS ken). 
7 i p. (pA. € tp | R4 € 9 
А D [n | ®-^- c ay) pro 
Be Bn [na ea) Lp | 
TTE 
Hence, using (14) and (15), we pave for n 2 И 
a (b? | R.A: € а) = 
У „| В.А. € a) 
4 R a) р(Е= 1% 1) 5 (к.А. E€ 01 | 
йд 00) EN р | 
` | 1,% 1) @<Ё<" a7) 
Sih Lite u 
м : раша 5% to form the 
ih „фу amalgamate the 2 LH. 
^r. Since X, < Ха» we May y i ста complete amal. Moreover, 
| ptain ? plocksin & 
s of the 


15 
Ubarray (® X; + Wp Хо қ Thus we cannot 


W+ W; 
n this way оп а 


she array 
e me mbers of Ё: „у WO obtam every one of t 
g^ he 
mer 


Derat; 
rating i 1 th 
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ч ituti Хх, +W; Xo) for X; an 
lection [n — 1] obtained from [n] by substituting (W, X; + aue n Н ы, 

“Ж сг +W,) for W, and W,, respectively. Hence, if an array of [n = 1] y ne 
pn pem Lada se da member of £7. also yields k blocks. But our inductive hy 
then the ) g 
applies to [n — 1). 

Hence, for т > 2. 


Fi n | BA. € Bhi) = By рт 1) 


(18) 
=P(k.n-1) (» «kb&n-l) 


(19) 
and Fin. n | R.A. € p4p) = 0. 
But 


и j acp) 
Вып [вле р) = SEE Rukn | в.д. є) prob. (Ra. є fj; |n. €) 
" all admissible 

27,1 


Hence, using (18) and (19), we have for n > 2 


Pulk, m в.л. €f)=P(k,n-1) ХХХ prob. (R.A. є 2... | R.A. € 2) 


ible 20) 
E rc = P(k,n—-1) (l<k<n-}), “ 
(2 
апа В.(т,т| R.A. € р) = 0. 
Now д 
ге) 
Tum) = Bi, n | к.А. € а) prob. (R.A. € a) - Fi (E, n | R.A. € р) prob. (В.А 
n—1 
But prob. (R.A. € a) = 1 апа prob. (R.A. € р) = —-—. 


n 
Hence, using (16), (17), (20) and (21), we have 


1 т 1 а. 
Fal) = 0.7 ЕРІН ПА, а 
1 n—1 1 г 
Foil.) = Pü- 1,n—1)- + P(kyn—1)2=! (2<k<n- ^ 
n 
_ (24) 
апа Вып, п) = Ритой 
т n g 38 
$ 21 
Thus we see from (22)-(24) that the truth 042, ; implies the truth of Pn eges the 
true. Hence, by induction, Z, is true for all positive integers r, and so we may de 
suffix [n] in (22)-(24), 


thus giving us the recurrence 


е relations (11)-(13). 
CoROLLARY. АП the above theory, including T 


ious 
ith obviow 
heorems 1 and 2, applies (with © jg the 
the array S, = (* ls м , where 2% Б 
n — 


Lows 1 pilitY 
ny eontinuous proba 


appropriate modification) to the amalgamation of 
ith member of an ordered, i.e. Successive, sample of size n from a 
distribution. 

Proof. This is because 

(а) for ordinates X г chosen as a finite sample from 
and weights W, chosen as any set of positive rea] 


W, =... = W, = 1), the condition @ holds with pr 
members of the sample, but not their order of oce 
orderings is equally probable to occur 


оў 
- E робо 
а continuous probability distr " ат, 88 
numbers (and hence, in pan the 
obability 1; and (b) if we аге 81 


кшй. possibl? 
urrence, then any one of their т! 


В. E. MES 323 


t to prove the conjectured recurrence relations 
f the Corollary. It should be observed that the 
eights, the use of ‘generalized’ 
ould have sufficed in the fore- 
together with the 


This work was actuated by an attemp 
(11)-(13) of Theorem 2 for the array 6, 0 
induction argument cannot be carried thro 
weights being essential. In fact, to prove the Corollary, it w 
going theory to take the weights W, as any set of positive integers which, 


same ordinates. satisfied ©. 


ugh using unit w 


ASSOCIATED WITH THE Р(Е,т) 


G FUNCTIONS 
As usual, the operations below are merely formal, i.e. they give results which may after 


wards be verified: see, for example. Riordan (1958, pp. 19-20). 
We define the generating functions 2), Ө). and H(z.!) as follows: 


5. THE GENERATIN 


Fz) = Y PE m)". 


n=k 


n k 
вй = X, PU m 
kel 


p PEPPER ad 
n-lk-l 
S 2 
= УВ! (25) 
k=1 
У 26 
= У 0.002" (26) 
n-l 
п ing we obtain 
Multiplying (11) by t2". (12) by (ал, (13) by "z^, and summing 
| Жы а 
да 1-2 
і tant. 
the Solution of which is H = са -zy^ phere ide. Dr 
Expanding the right-hand side as а þinomia i Р 
? i eem Due (27) 
Н- орн gcb o wl um 
P X Pk n) 1, or G,) = 1. (29) 
e ; 4) = 1, x 
на, Hc i= (1-2) E 
‚ Бу (96)-(95 ELA , 
у (26)-(28). C ae eee o 
ads 6,0 = »i 
By (29) H = exp [п ( E 
tial series; 
D Р ‚ ыште a8 80 € ропе! 5 
"ending th igen UI а iem 
C | 
H- „о К” k! 
" 32 
разца (25) and (31), it 9098 бай goja. (32) 
ЕД9) kl ih respectively- 
rmine p.m for fix 
e 


(80) and (32) enable us to det 
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6. THE RELATIONSHIP ОҒ THE Р(Ь,п) то THE STIRLING 
NUMBERS or THE First KIND, 5% 


Y Sp = = 1)... (t-n4- 1), 
The S* are defined by X SU = Е 1)... (1—24 1) 
from which it follows, by substituting —t for t, that 
X [S5 | t = t(t+ 1) (E43 n— 1). 
k=1 


n КЕНТ)... (t - 8 — 1) 
But, by (30) X Pik, п) = Ee —. 
k=1 Ж 
1 ; 
Непсе P(k.n) = n |85, 
Білсе Stirling Numbers are treated r: 


content ourselves with noting 
the P(k, n) 


түт -¢ shall 
ather exhaustively іп Jordan (1947, Ch. IV) ме 


доп with 
s А “йшй exion V 

a few of their more important properties in conn: 

» giving references where necessary. 


7. PROPERTIES OF THE P(k,n) 
The general solution of equations (10 
they can be solved easily; 


; ne cases 
)-(13) is unknown (Jordan, p. 143), but in son 


| В ET 
since there is only one permutation ø such that X,, > -- 


Ма-а, 
uc LO ME В 
S(n—4)* t mE 
and so on (Jordan, p. 149). dis ез 


We give a table of т! P(k,n) for 1 < k <n<12 (Jordan, р. 144). 


Тһе mean I^ and the variance 9? of the number of blocks yielded 


n 
3 ЕР, т) = 6,1). 
But log G, (t) = logt4... +105 (t4. 1)—log (n!). 
Thus 6,4) _ 1 1 (33) 
9. (0) = gr M m 2 
Using (28) and (33) Blei, gE ge 
n 


The ‘mean length of each block’, [= 2 у " 
ё “ЮР л) | S. БР) = н. From ( 
Cnlt) Ө) — [G^ (ур 1 
LHe = 9 = Tm 
[OP MEC 


В. E. Mazes gas 
Hence о? = 6" (1) +60 G0) 
= 1 T 1 
= (reret i) (4-45). (35) 


bo 


Table of n! P(k.n) forl<k<n<l 


| | 
n 
i IZIS 4 | 5 | è 1 8 9 10 11 12 
| 

1 | | 

d ЕЕ әд | 120 | 720| 5,040 | 40,320 | 362,880 3,028,800 | 39,916,800 
4 |1із|п1|50|274| 1,764 13.068 | 109,584 | 1,026,576 | 10,628,640 120,543,840 | 
5! 1 35 | 225 | 1,624 | 13,132 | 118,124 | 1,172,700 | 12,753,576 | 150,917,976 
E | 1| 10| 85| 735| 6,169 | 07,284 723,680 | 8,409,500 | 105,258,076 
& | 1| 15| 175| 1,960 | 22449 | 269,325 | 3,416,930 45,995,730 
7 | 1 21 322 | 4,536 63.273 902,055 | 13,339,535 
$ 1 28 546 | 9,450 157,773 | 2,637,558 
5 a alos E 1 36 870 18,150 357,423 
10 gifs poe s ў : : 1 45 1,320 32,670 

s БЕ : 3 Wu 1 55 1,925 

кыр» eee) > " xo» «ul | à 1 66 
12 | | Р | : р Р 1 


Asymptotic properties, as n > со 
From equation (5) of Jordan, P- 160, we obtain, for fixed k, 
[log n]*7* 


P(k, n) e nk- 1)! . 


k- P(k,n) = 0, uniformly in k. Let 
— шах. P(k,n). 


P(kmax.» п = iml, i 


oximately (Jordan, P- 161) 


Т 
hen, for large т, we have appr 
Imax. > 108% > шк” ў. 


Ву (34) u ~ logn 
Hence 


Thu Р 
8 both шапа l = 9; however, — 
n 


ob. (no. of blocks yielded by 
у ~ Ф(0)- O(a). For a 


f § 8 is discussed. 


ЕЗҮ. 
entral limit theorem de e a wt 
| nd 105% ) 
log n 4 a log p "s iem is 
(1957, pp. 247 


B 
У (35), o2? ~ log n, also. 
а 8 we might expect, а © 
Juste of [n] lies between 
cation of this, see Feller 
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8. 'Two RELATED PROBLEMS CONCERNING RANDOM PERMUTATIONS 


T: i - (1957, 
(а) The following problem is treated іп Riordan (1958, pp. 66-72) and in Feller ( 
pp. 242-3). 


lid А 87” 
‘What is the probability P’(k, n) that a random permutation т ( “и ) үн key d | 
Surprisingly, it turns out that we may derive the equations (10)-(13) for P'(k, n) i 
similar, but more elementary, way and so P'(k,n) = Р(Е, т). 
(b) The second analogous problem may be stated thus: | Tike аа st 
‘Let с (0,...0,) be a random permutation of the integers 1, ...,т. Та 


é , here 
; “block”, W 
“block” (тү... ту), where б), y^ саме б;(- т); (0; 41::.0;,) as second 
" З 


n В leamation 

93. = шах gj and во on. What is the probability P”(k,n) that this amalga: 
j=0:+1), ...,n 

process yields & blocks in all?’ 


By a simple argument (in which the 


; е 
: А tain th 
position of » is considered) we may ob 
recurrence relations 


1 
P'(1,n) == 
т 


n=l 
and P"(k,n) = 1 У P'(k—1,s (2 << n). 
1 


s=k— 


It is easily verified that H "(z,) = H(z,t), where 


НУ 
т-1 


k= 


Р", т)", andso P"(k,n) = P(k,n). 
1 


This result may altern 


atively be derived as a 
Select an n-collection [n 


llows- 
corollary of the above theory, as fo 


у s 1 which 
12 of ordinates X,, with X,,> ... > X, and weights 1,..› 
satisfy the additional condition: 
9: Хы. Хх, > or < Х.Х, 
according as ax Dc юг G6 шах Pis 
i=l, r i=1, ..., (r-+s) d 
for any (r+s)-permutation P (P1 --- Pra.) of the integers 1, +)”, where 1 67,8 <” 
2<т+в< п. with 
We may use a step-by-step construction to verify that such sets of ordinates exist: D 
the n weights 1, ...,1 select arbitrarily the ordinates X, and X, with X, > X,; Ха that 
satisfy @ (with weights 1,1) and 9. Now choose Аз, > Х. and sufficiently n д 
X, X, and X, Satisfy % (with weights 1, 1, 1) and 2. ата choose success 
Жар Сә. Ж, 


Now consider more closely the complete ama]. 


Жау of [818 
by AZ, of the R.A. [^s T | 6 
The first block will be T у 


Тан | pu 
p Where XL = max XR. But, by 9, Ж, = max жые 
vt , x =1,..,в к жыйы 
Similarly, the second block will be Y t+, where PA j 


= шах Х.;ала во on. т 
а 7=(:+1),...в 7 Я roces f 
e same random Permutation с, the amalgamation P pik,” 
under the correspondence Ti X.. Hence P”(k,n) = 
i 


It is now seen that, for th 


the two problems is identica] 
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ыы H H H H 
he problems (а) and (5) lie primarily in the realm of combinatorial analysis rather than 


probability theory (as perhaps also does the rest of the work). 


Bartholomew, for originating the problem, and Mr 29. v. 


Iam indebted to both Dr D. J. 
d giving helpful advice. 


Lindley, for bringing it to my notice an 
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APPENDIX 


o truth of Bartholomew's conjecture (ne 
Theorem 2 with the probability distributio 


as defined in $1. Thus 


*) for Pu, k) referred to 


We А 
аге now in a position to prove th a takon tob 1 
ken e normal, № 


in 
is 81. First, applying the Corollary to 
seen that P(I, k) = PP(L k), where P(l, k) is 
lig 
Pak) = PUA) = д [58 


ч complete the proof, we prove that the conjectured recurrence relations 
ксі pes Е 
Ари: 5 Ёа i ШЕШ » 
; l гісі | 
(7%) 


^ 1 
Pab р 


hence thereby en: 
g equation by 


(9<1<%, 


Where 
suring their own validity. Using 


^ 
;) as above; 
a for P(L k) 88 zk and sum from k = 1% co. 


yiel 
d the same explicit formul 
ж). Multiply the resultin: 


(**), substi P 
), substitute for P(1,&—7) in ( 


riting n mrs 
Bo = Х PUL 
" lof, ue) | 
this gives Ble) = LR snnt Lact 
444 
Al ^ 
80, by (**), Pe = PEE t 


Fr 
0 
тп the last two equations, We have 


o them. It does notseem 
ren. However, (*) is but 


by 
(32 ^ Я ti 
). Hence Pil, k) = P(L 9) for all ly 5 _(**) are indeed өха® >. „ре giv 
(9 tation it m in $8(5) for P^(1,1) = PU, k) 


hus, while not as simple ав(10)-(19)» j : e! 
ылы to provo (*) directly despite the pero inte p). We have [8 = it PU, B) 
and ОЁ a number of recurrence relations satish Уту), by using the кысы, i 
d in Jordan 1947, У" prunk of the University of Missouri for 
. 8. Andersen (Math. 


yet more may be foun D 
E before bY 
sidered ce relations (10)-(13), 


m grate ul to DT 
of. 18 s been com ту recurren 
little or no overlap 


Хой 
the ; e added by author in pro 
informati 2 e of problem h 1), P: E М 
ation that this typ 8). He derives ( д БАЗ from this. there 18 


M 
usi nd. (1954), 2, 557,8, PP- 209- 


bets, ап entirely different, but less 
еп Andersen's paper and mine- 
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, E 
A TEST OF HOMOGENEITY FOR ORDERED ALTERNATIVES 


By D. J. BARTHOLOMEW 
University College of North Staffordshire 


1. INTRODUCTION AND SUMMARY 


"S 2- and 
In an earlier paper (Bartholomew, 1959) referred to subsequently as I, а d 
F-tests were introduced for use against ordered alternatives. The need for t ші ДЕР?! 
the following way. Suppose that we have а sample of k independent values x ^ 78. 
where 2; is normally distributed with mean т; and standard deviation gi is a chart А 
A test is required of the hypothesis Ту = My = ... = my (Hy) against the at d after the 
M, > Mz >... > т, (Hy). The modified tests, denoted by X? and F, are calcu 4 5 son their 
original values have been ‘reduced’ by an averaging process which depends up 
order and magnitude. . — and 
The distribution of these statistics was completely determined for Ё = 3 an nicis "e 
tables of percentage points were given for X?. The distribution depends upon cer 


: ination for 
babilities, P(I, 5). In the general case, when the o’s are not equal, their determin: 


ivariate integr?^- 
k > 4 wasnot possible because of lack of knowledge about the normal multivariate 
For the special с 


. was 
ase of equal standard deviations, however, a recurrence ean that 
conjectured. Miles (1959) has proved this conjecture and has further жне” it 
РЦ, k) = |511, where Sj, is the Stirling number of the first kind. This result has rier 
desirable to extend the tables of percentage points and to give some further prope 
the distribution. This work occupies $2 of the present paper. 


: PS rdering 
It was pointed out in I that a test was required for use when the direction of the 0 
was unknown; that is, for the alternative 


As: m > my> ... > ть OF My > Mya >... > т. 


| new 
Such a test is provided in $3. It is a two-sided version of X? (or F) and requires 10 


ИЕ 4 k= 
distribution theory. Some extra significance levels have been given for k = 3 and 
to facilitate the use of this version of the test, 


A good approximation to the di 


ined; 
Boome aine 
stribution in the general case, for Æ = 5, has been obt 
this is given in 54. 


2. THE DISTRIBUTION OF 


Х FOR EQUAL WEIGHTS 
2-1. The distribution of X when о, 


= бз =... = 0, is given by 
- 1 к қ © 1) 
Prix? > y} = и | P(X) dx? a, | 
= y 


Pr {x = 0) = 1k. 


A table of Stirling numbers will be fo 


und in Miles’s 
points of X? calculated from (1) are giv 


d 05% 
paper. The 10, 5, 2-5, 1 an 
en in Table 1. 
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Table 1. Percentage points of Xi when the weights are equal 


100P | 
10 5 | | 
A | " S uw ! | Bii 
EN = = | 
3 2.5 р | 
: 2-580 3820 5098 | 6892 $146 | 
4 3-187 4-528 эп | тш | 90% | 
р 3036 | 5049 6-471 8-356 9.784 
6 3-994 5-460 6-928 $865 | 10327 | 
: 4-289 580 | 7300 | 9384 | 10-774 
à 4542 | 6088 7094 | 9.639 11-158 | 
9 4761 6:339 7-901 | 9-946 11480 | 
4056 | 650 | 81 | 10-216 11567. | 
1 5130 | өле | 996 10-458 12098 | 
12 5-288 | 6-937 | 8-561 | 10-676 12-257 


able for certain sets of un- 


ribution (1) is also applic: 
on the (2- 2) parameters 


al distribution depends 


| (i218 ss E72) 


Itisofi . 
nba. of interest to point out that the dist 
l os. It can be shown that the gener 


T - | Uliye 
бына (ауаға) (аға + tie) 


wher 
ere а; = 1/03. These parameters are all equal to one-half when d, = @ =... = а, but 
f a's for which this is so. The reason for this is that, for 
su, E71). 


iti 

any wu ба to find other sets 0 

Тһе Pew d № there are (k — 2) equations to deter: 
ecau à ribution would therefore be described m 
lon es of the much greater practical importance 

ed in I has been retained. 


2.9 x . 
ет Properties of the distributi 
ев of the distribution are 1108 


* pi, k) (1-2 
és) ed by 


(e+ I et- 
a k! 


mine (k— 1)ratios, ааа (@ = 1 2, 
ore correctly as that for equal p's. However, 
eof the case @ = da =. = a, the descrip- 


he moments and limiting pro- 


weights. T 
aracteristic function 


on of 9 for equal 
proached via the ch 


t conveniently ap 


ity ic? 
(z+k- 1) 


(30))- Hence, for the cumulant generating 


Where ҮШ "T . don 
(1 — 2it)-* (see Miles (1959) equat! 


Unas 
nction we have кі, +2) log, ! 
T = og, (7 wee 
The fi vi) = log. #0 A и 
Tst four cumulants are found to be 
k Kt с 
к= УЛ" gi к] }, 
E (3) 
k PPE, 
E ae ee 
2 
k = саа ву 6j 
"c 05j2-87 4-30) 
Ка ха ы Biom. 46 


21 
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Some numerical values of £, and f, are given in Table 2, from which it will be seen that the 
distribution is extremely skew even when £ is quite large. In the event of having to test 
significance for large k a curve could be fitted by moments. A x2-curve, fitted using the first 
two moments, gives a very good approximation for k = 10. As k — оо 


(k-1 et 
Te+lk~ e+ 


using the Euler-Gauss product limit formula for the T-function. The cumulants of the 
limiting form are thus given by 


p(t) > 


K, = 1.3.5... (2r — 1).log К. 


The distribution ultimately approaches normality, but only very slowly, as shown " 


Table 2. 
Table 2. Exact and limiting values of В, and By for X? (equal weights) 
k 10 20 50 100 103 10% | ще 
| pnm 
Exact 414 | 31 E * г 
8. ( xact 310 | 2.30 1-94 = sa 
'lLimiting | 3.62 278 | 213 E81 | 121 0-60 am 
| | | ae Ж 
Exact 8-88 А [р - udi 
8, | xact 7-39 6-28 5:75 - m 
Limiting 8-07 6-89 5-08 b 4-69 ged | 3% 


2:3. The distribution of F, when the weights are equal, is given by 


ISu f ий v ar | 


Bi 2) Sel LLM —1), 1-1), | 
Pr(F = 0} = 1k, | 


z= 0- DYN -14 (l-1)y, 


D. J. BARTHOLOMEW 331 


e the method, is given below. The following test procedure 
(or Ғ) as if the alternative were m, > ma >... > ть, and 
тү. If the two values of X? so obtained are denoted by 
(727,57). Since Y? can be derived by the 
ded version сап also be obtained in 


А further example, to illustrat 
is suggested. First, calculate Y? 
then as if it were m, > my 42.72 
x and 3°", the test criterion is defined as max 
likelihood ratio technique it follows that the two-si 
this way. 


Example 


The table given below has been cons 


Wynn (1952, 1953) on explosions in coa 
e two-sided tes 


d from data given by Maguire, Pearson & 
] mines involving more than ten men killed. It 
$ and draws attention to а further field of 


illustrates the application of th 
s to be tested is that the explosions occur randomly at à 
currence has either increased or decreased 


application. The null hypothesi 
herate of oc 
te differed from one period to 


constant rate, The alternative is that t 
during the period. If the alternative stated only that the ra 
be used to test the null hyopthesis. The 


another the Poisson index of dispersion would he 
application of 72 to the frequencies is equivalent to calculating this index for the reduced 
Problem. It has to be assumed that the Poisson frequencies are approximately normally 

2. distribution to test the significance 


distribut ; dan aAa T 
ed; th mplied in the use of the X 
bs | ate here. We therefore have to test 


ОЁ the index of dispersion and will be considered adequ 
variables with weights equal to the reciprocals of themean 


the homo, $ 

geneity of a set of normal a Ч 
frequency. fe carrying out the averaging process indicated on the right of the table we 
arrive at the following two reduced forms. 


tructe 


X “01081008 
y-sided 52-1681 to data on the number of exp 
seen killed (1876-1945) 


Tabl сай 
e3. Application of Pre than ten men 


in coal mines involving 
а Р ing trend 
Danica | N » s Decreasing trend | Increasing tren: 
explosi gm 
__ менен m | 
1876-85 | 81 | 20 | 
1886-95 | 2 шү a | 
1896-05 | 1 ) 10-5 | 
1906-15 | 14 | na | 
1916-25 | 5 } 100 ] 11.3 145 | 
1926-35 | 15 H | | 
1936-45 | 14 pum om à 
BEN —  "Moan frequency = 1543. 
р 11 
1 92 
(2) Decreasing trend. Average: a 1 5 
Weight x 15-43: as 
: 132 № 
(0) Increasing trend. Average: | р 
І Weight х 15:43: p larger than from (b) so we have 
tig jill be mue 
Clear that x? calculated from (a) v 12-705 4373/1543 
xe") = (rn 2280 п) 
К 21-2 


Biz ^ 24-85 
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: . ; soni nt 
Tt will be clear from the following section, and Table 1, that this value is highly — 
and therefore that the data provide strong evidence for a decreasing trend. This a “ne 
is in agreement with those reached using other tests (Maguire et al. (1953), Barnard (1% 4 
x2! к, the 
3-2. The distribution of the two-sided test. Let E, denote the event X" 2 Y and Ёз 
event X" > y. The probability integral of the two-sided test is given by 


P(E, +E) = P(E,) + P(E) - P(E, Ез). 
By symmetry, P(E,) = Р(Е,) so that if y is the 1002 % point of X°. 
P(E, +E.) = 2x — P(E,E,). 


Tt can be deduced immediately from this that 
в < P(E + E,) < 2a. 


In the case k = 3 it is possible to determine P(E, Е.) exactly, using tables of the pum 
bivariate integral; these caleulations show that it can be neglected by comparison with 29° 
The following heuristic argument indicates that this will be true for all k. from 
If y is large (so that о is small) the events Е, and Е, indicate significant departures 1T s 
the null hypothesis in opposite directions. Consequently, the fact that one has occurt 


т we 
reduces the chance that the other will also have occurred, that is P(E, | Ej) < % TPS 
have 


P(E,E,) = P(E,) P(E, | B,) < о. 


Since о is small, we take 
Pr (max (g^, ^) > у} = ЭР > y] 
TNNT еы сап 
when testing significance, the maximum possible error being a. The existing tables ple 
therefore be used if the significance level is doubled. The value obtained for the exami 


in § 3-1 for Е = 7 Нез well beyond the 1 % point for the two-sided test. 


; of 
NL In order to provide 5 and 1% points for the two-sided test, 2-5 and 0-5 % epit 
Х have been computed. They are given in tables 4 and 5 for = 3 and К = 4. For goni 


i рең umn 
ence. Table 4 also includes the 5 and 1 % levels already given in I and an additional col 
of 10% points. 

Table 4. Percentage points of x 
100P | | 
10 5 : " 
— Рз | 25 1 | 0:5 | 
| | 
5 АРИБИНЕ: не 1 | | _ 
t. 2 | a | | 
2.05 | 

0-1 зав | m | see 7389 | 86% | 
| 02 | 216 +108 5-459 7-208 8543 | 
| 03 | 2810 RES | 5378 | 7-122 8:455 | 

2-742 4-001 5-292 7 8:360 

| 04 2-664 3914 | © 1-030 
0-5 9.580 | 5-200 6-932 8:258 
|5 3-820 5-098 6-822 8-146 
| 

a ы zug 4-985 6-700 8:016 

08 dm Feld 4-852 6-556 7-865 

09 | 2080 | а 4-689 6-377 7-677 

1-0 1-642 2-706 205 6-130 7413 

3-841 5413 | 6695 
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Table 5. The 2:5 


%( кемі figure) o ата 0- 5% 


m Уз ра | | 
EON 0-0 0-1 0-2 03 0-4 0-5 | 0:5 0-7 | 
| | 
| —— ки (Жеке 
| | | | 5 
0-0 6861 = РЕН = = E | = P^ | 
10-171 = == Е | - 
01 | 6795 6:724 РЕ єч с | 20b | Es ре | 
10-098 10-021 an КӨШ | | | 
0-2 а | | | 
6-649 = { | | 
9-939 
"S 6-653 6-571 
| 9-852 | 
n 6-486 = = M 
i 9-758 — =н 
95 6-491 5:891 m = 
9-761 9-092 — ый 
os 6-398 5702 um | с=з | 
9.656 8:877 -604 = 
0-7 5.443 5-100 4:346 
8:581 8:183 | 7:978 
0-8 4.999 3841 | — 
| | 8064 6-635 = 
= | 
0:9 = | == 
| = | T z 
IE dies = E 
0 _ "ia |. = 
aê => | | 
> a SS at 
4, THE pISTRIBUTION ков k=? — 
| is not possible to test signifi 
4 ainus one-half it is 00 
9 SNE ее relations 219 jac that the distribution for equal weights 
sien exaetly for large k. Existing results sugges i divergence of the p’s from minus 
ау be a g nless there is 9 79 > еее iiL. 
good ap roximation 0 й by pooling as 8188 ested in 
ар, Alter iere the number 0 алы 22% с ден ven of the existing theory. 
Fortunately а — many practica al problems all wit ds Ts Dd pad or—extreme, 
А á air 8 
man еті) did of the kind— —very good шн, s в five or SIX categories. It is 
oderat jonally ru? RUNS e give a good 
en jed-—0008810 * 1 what follows weg d 
therefore = S "to obtain some e result these cases AA method might yield useful 
aPproximati E api E; and sugg hat ее f th exact distribution lies in 
тезше ийни Жр баса iy in the de mination 0 $ values which involves 
/ тем ál 
finding jn k = 6. The only a cu x id of obtaining i ex! е pecus 
5, 1954 ives 4 ; ау! oide! 
Numer} (5,5). ш: | ou arithmette can be av" cho positive quadrant о f he 
bs oe integration. However: PG 5) ist je volume in р 
ula given by the same auth! hor. PO" trix 
оц У ‘elation n? 
T-variate normal distribution n with cor б 
қ 1 Рі 0 
1 P28 
2 
i= Ё P23 1 д 
o 9 pss 
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ives th roxim: formula 
Plackett gives t! ba^ ate for -— 
= T = ее ® А 
Р(5, 5) = pe: [cos-! ( — p,5) cos (— Pza) + т = 


t 
ion i r the presen 
i «imation is very good for : 

: - It is shown below that this approxima gor cu di 
x E a = — P23 = — рад = $, and cannot lead to a large error for = m 
грозе when — p, = — p, i adt 
ot Using Plackett's approximate formula for P(5, 5) and the Bun о function 
tain the following probabilities. They are expressed in terms of sin бе tem sai 

m been extensively tabled in the Table of Arcsin x (1945). Тһе formulae h 
fied by using the notation of partial correlation. 
š қ l 
P(5, 5) = {sin (—p 2) sin! (— p34)}/ (472) қы інін 
к” | — (sin! (руз) + 1-1 ( — pj) + sin! ( — рза)} 0 
mp Paa tein (— pu] (n) ' қ. 
P(4,5) = 1— (sin и Pig) + Sin-! ( Psa) $l ( i - МЕТЕОР 
(sin? (— p23) - sin! (— £234) + Sin? (рз) + sin ( Bs (— раз) (877) 
CN NET TE M t ) 
P(3,5) = }— (sini( 7 P3234) Sin (руз) Е sint (— P3493) + Sin (— osa 8л) 


+ 81-1 (рола) + sin? ( — pasas 
+ {sin ( — 33) +sin- ( — Рэз) + Sin! ( — P34))/ (87) 


— fsin-1 ( — 81-1 ( — р...  sin-! (— p, ) sin! (— 01,55) 4m): 
(sin? (—p,,) (= P333) 23 айне Ер) sin-} ( — лоза) 
P(2,5) = 4—P(4,5), 
5 
=1- P j,5 . 
F8) 1— E Pj s ЕР" 
4-3. А comparison bet 


en 
ween the exact and approximate probability integral м р 20 an 
for the case — Pis 3. In this case the exact value of Р(5, 5) d ance lev? 
iscrepancy has a negligible effect on the signt " t serious: 
even if P(5, 5) is put equal to zero the effect is n 


Table 6. Comparison of the probabilit 


y integral given by the exact and 
approximate formulae for P(5, 5) 


when — Раз = =P = -pz = $ 


Р , 
Exact Plackett' Р(5,5) - 0 
y approximation 
N za аз | 
5:049 0-05000 0-04961 0:04765 
8-356 0-01000 0-00989 0-00933 


Plackett also 


Bives a table of exact values of P(5. 5) 


‘on 0 

) раб P(5,5) = 0. Tabulation ^4 pe 
th three-way entry, although the size 2 

and the exclusion of extreme values of the / 9: 


reduced by making use of Symmetry 
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or example, when k=6 


The method used here would give 8 partial solution for Ё > 5. Е 
We could assume that P(6, 6) was negligible, obtain an approximate value for P(5, 6) and 
ibility has not been explored. 


determine the remaining probabilities exactly. This poss 
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THE MULTIPLE-RECAPTURE CENSUS 
IL ESTIMATION WHEN THERE IS IMMIGRATION OR DEATH 


Bv J. N. DARROCH 


Statistical Laboratory, U: niversity of Manchester 


l. INTRODUCTION are 

1-1. Inaprevious paper (1958), which will be referred to as (I), we discussed the ж 
recapture census when the population is closed both to augmentation from outside а 
departure from inside. These restrictions are now removed. опа] pro- 

Let the experimenter take s samples, as in (1). Also, let 1—¢, be the conditiona iih 
bability that an individual dies (or permanently emigrates) between the th and (0 pn of 
samples given that it is alive at the time of the kth, k = 1,2,...,s—1. Let n, be the E ds 
the population at the time of the first sample and let x, — ny—ı new individuals rise 
(or be born) into the population between the (5-1) and kth samples and be alive = ted 
time of the kth, k = 2, ИЕ Nothing is assumed about the т, — ты and they are tr uw 
ав parameters of the mode], To treat them as random variables would entail assumpti a 
about the manner in which they vary and would complicate rather than simplify 
probability densities. 


ith 
In $2 we shall take $x = lin which case n, is the population size at the time of the am 
sample, ; — 1,2,...,9. In $3, 7; = n and the size of the population at the time of ун 
sample (i > 1) is a random variable with expected value 1$,...Q;.,. The genera’ ™ 
when there is both immigration and death will be considered in § 4. 
1-2. The main aims of t; 


of 
€ have been unable to obtain satisfactory 5. ше 
ls, notably that, tagged and untagged anim? а) 
е above-mentioned tests on the values of the P 
We hope to fill this gap at a later date. ді” 
rature and a list of references in (I) but, unfor ций 
е, Chitty & Chitty (1953), the third of three же to 
tain severa] ingenious mathematical аррто8о' 


рша" 
vith a very full discussion of field data on por ісе: 


та” 
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en account of in the following pages but we note here that 
served classes ‘caught in the à, ..., samples but not 
e for those animals which are caught in the same 
Also, an additional parameter is introduced to 
s killed or removed at any sample. The 
but, apart from this, there are no 


This eventuality has not been tak 
itis easy to do so. Each of the usual ob 
otherwise’ is coupled with an extra on 
samples but killed or removed at the lth. 
represent the probability that a captured animal i 
estimates of the other parameters change slightly 


mplicatior S 
18. 
2. IMMIGRATION BUT NO DEATH 


ke all values from 1 tos while suffices k, l take all values 


2-1. In this section, suffices 7, j ta 
from 2 to s. 

Let р; (= 1 — q;) denote the probabilit 
the ith sample and let a; be the size of the it 
caught before the th sample and аск. the number which are caug 
Sample. Further, let w denote any non-empty subset of the integers 1, 2, 
number caught in every one of the w-samples but not otherwise. Let 


T 
r= Уй = Lut 
w i 


f the population is caught at 
be the number of individuals 
ht before and at the kth 
...,8 and Uy the 


y that any member o: 
h sample. Let ас 


E ugt EXC S 
i<j 

e experiment. 
ffices for most of $2. However, for the 
vea further notation. Namely, 
egardless of 


ght in the whol 


the total number of individuals cau 
as in (I) and su 
renient to ha 


p The notation thus far is the same 

urpose of deriving the density pl fet} it is conv 

| ibe vi jn ever, f the w-samples, г 

let a, be the number of individuals caught іп every one 0 p 

Whether or not they are caught oth 

с е begin by finding pl] for i 

atching a, individuals in the first sample i$ 
(se 


ay, 


erwise. 


в = 3, using а с e argument. The probability of 
-— , © 


hainwis 


zhi у го the first 
Given q а pability of catching 4з in the second, of W hich а, are common. to the firs 
1 the probability ої са 5 “2 


an , 
d а-а, are new ones, 18 (" | кел — 
in the third, of whicl 


Gi hing а 
У ili ching ds 
еп ау, a, ауз, the probability of catching Sm to the seco 
5 rst only, 4237 715 


fir 
st 
h^ and second, @1з— 4193 
ac : 
аа — а + ау; are new 0068 is n 
T -а-а” 12 Jota 


h ау are common tothe 
nd only and of which 


and renaming he terms involved 
а t 
> 3 


Multi ; 
lplying these three expressions t 8 | 
We obtain в ‚м ИЕ (ns - 4-9 i П рид“. (1) 
wb Ma i ss 
plius = gir sia (ng 4з)! nm a У 
1 2 


N 
Ote that, if n; = n. (1) contracts to 


Whi * 
hich is the model forming the basis 
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(1) may also be written " 
m qf1-7« (210503) (P1 P293)" (P1023)? (P1P2P3 
рш.) = (n4 — 822) Uy! Uya! Urg! Urag! 
x HUE. 4 ^: (р»@)'з (popa) 


(ng — 823)! Up! Uag! - 
Ж а qd: plis. 
(n4 —7)! из! — 
Another way of exhibiting the distribution of the и, is by means of their g 
function w[{t,,}] = ЕП #*]. It is soon shown that ' 
w 


У, = (919293 + 219436 + Pr Podatre +P1G2Patis + P1PoPsties 
+ 912293 + Q1 Da Dslo + d1 19315)" 
X (dada +22431 + DaDalos + qo P3tg) "1" 


(2) 
X (qa + Pats)". 
2-2. Let us define а.) = 0 and 07543 = 7. Then for general s, (1) becomes " 
1 pa ааа ( 
pv] II бь—а<) “а-а, 


p 
Iw! (в-а)! 
w 


Maximizing with respect to p; by equating (9/0p;)log p[{u,,}] to zero, (4) 


NP; = а. 


Maximizing with respect to ^; by equating A ni log p [{u,,}] to zero, 


5 (5) 
ааыа 5 


The equation for i = 1 in 


ig 
: Е ly no} 

(4) is the same as that in (5) and there is consequently 
formation on n, and Py 


* t pat 
: е 1n 
separately. Otherwise, combining (4) and (5) and remembering 
асрар = Ack td, а, р We obtain 


6 
Pr = СЕТИХ ( ; 
ck 
and ñy = Penh, 
a 
provided az, р > 0. sg ii 
H t + . a 
2-3. Having found maximum likelihood estimates from the density (3) We e 
apply maximum likelihood large sample theory to them directly, for reasons 8 о 
(1) $3:2. The principal obstacle is the Presence of the n, in the factorial terms of (3). pat i% 
over this difficulty, we use a suitabl 


we apply maximum likelihood lar 
some of the sample variables and 
this case, the most suitable condition 

The conditional distribution of a 
when the distribution of a 


] o? 


\ inn 
«ial. . i 
тш В including 
<i” Ge; given a, is B[n; — a, pi] (incl 


т-а, 
Ра} = П lo ecd Ріната, e 
1 
f. - 
Hence plv, | [mE Пи I а-а-а П рй<к.крй<к-@<к.к. 
k 
w 
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Reverting for the moment to the parent density, we see that 

А thy = (а.а <k) D + < (8) 

W: m T > . " К 

k here фу is given by (6). This equation will prove useful for two reasons. First, а, and 

үре are held constant іп the conditional density (7), and secondly, фу is the maximum 
elihood estimate of p, for the conditional as well as the parent density. In view of (8), 


it turns out to be preferable to change from py to 0, = pi. (8) then becomes 
fp = (aiia 7 < 

Let L = БО, = онр) асы Th 
oL ае y Gk Фе, 


5. 0 ^ 
provided ала>9 Further 


) e+ tcr (9) 


whi в 
hich, when equated to zero, gives 0, = à zy] ek 


oL mmm 
ҒА («4| = 90,1) 


eu Я 
аен =0 (b+), 
k ; 
Yielding the asymptotic formulae 80,1) 
А _ Өң 
(6-04 mile ee c 
800.00) (8,— 00 | КУЫ 0. Р | 
_ When ас». = 0, the Taylor expansion of 0L [00, at 9, about its value at 0, is meaningless 
Since there is 8 fnite solution for д». However the resulting invalidity of the last formulae 
i > 
8 easily avoided by changing from 0 to 
‚бек. 
ET ak. t г 
+, variance differing only negligibly from that 
d has an asymptotic v2" {зл ing the é technique.) 


Sing ye 
© 01, is always finite an 
k Over the finite part 0 


f the latter's range. acp nis Blac py). Therefore 


9reover, бі, has a negligible pias. Fors | (1-9 ұн) 
Ai Е(б | {а} = Мб prm РЁ aede 
t А p jon апа Vk * ; 1 
ООО Т с Mg i а 
distributed ергена: р his fo ows from the indepe™ on О инда, we eg a 
Property ее given {аа} 1 - ‚ Secondly using - 
hich is proved WI PO 09 [a2] 


y 0, | ia ss 
zi. 69 Hoan 000.10 0|]: 
=—a 3 ас 
<k 


we replace fi, by 


88 
<k > оо. ecalling (9), 
eturning now to the estimation of the Пк and т 65. 
п, = (ace 9" Ke "m 
th, (dex 1)@+1)-1 | 
ati iL s = 1 
8, by та аак? å (пъ 21) Prl- (11) 
Now ) (б: о) xs 
—üzk 
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Therefore, taking expectations conditional on {асу}, 


қ — (n4 — à 24) Pe] 
E((ni —4) | {а} = — (021.4 а) 09981 Oda. — Ч) — Q0 Ак) Ph 
п Ne 3 


Now taking expectations over a... .... ат, Чуть ens куча» 


а 1 
Е[(т.—т) |а] = = (п.- а.) рО 
= = (my) gE, 
since аар. lcr = Вт, — Az), рь]. We shall neglect this bias. 
Squaring (11) and taking expectations. 

(а-а) | az,] = а, 4.) pit (Me = 824) Prq] BU (Oe — 0, | ai] 
%20,т,-а-,)р,4,Е(0,-0,)|а./ (3) 
+0 (т. — 024) Pe Us -— 

, 2 БЕГ AC ^ ss to 
Tn deriving an asymptotic formula for E[(nj — n3], a convenient limit асың term of the 
ni 7 co such that n;[n; -> с, constant. To evaluate the expectation of the fir күс 
right-hand side of (12) over a<r, consider separately the ranges 0 < a2; < 


WW НЕ sum of in- 
hn, < dex, € nj 4, where h < 1—4,...q,., and hn, is integral. Now а. is brc Sens 
dependent binomial variables B[n4,1—q, ... ea], ..., B[n,.., = т, ры an 


1 ü 
Pla, < hm] < РЕВ, 1-4, deal < Мы] = O| ез). 


where c= ( hp) (8 - e « 
| tribt 
<k+ 1. Therefore, for the range 0 < az, < hn,, the jm parto 
of the first term of (12) is at most O(njninylcm)— O(nic':) = o(1). For the 9 the secon 
the range, we may use the asymptotic formula for the variance of 0). Since 
term on the right-hand side of (12) is o(1), we may thus far say that 


10101 
Also д), can never exceed а 


Вт = E (D 
ck» hn, 


t0, — 1 | 
| = и) Ж. [Ж т„— а. 1) Did] +01). 
a, Wek а-а>0 5 
&<r = Ва] = т; (1—4... Чь-1) +... + (nj. ,— Nyo) 


Let 


Dy at 
| | dth 
Then, expanding 1/a.., about l/a., апа taking expectations over а-ы it will be foun | 

Ж (18 
El (nj, —m,)2] = "iti —o..,)q, (1). 
Cp. 
Note that, when all ni = n, (13) becomes жа 
Е (nj. - ny] = i s > ES w =J о 1), 
: TES (hy. daa) p, 
which was derived previously in (I), 4,48 
Р А Е 3) ane: 
Thus, the n; are almost unbiased estimates of the n, with variances given by (13) "m 
is easily shown, with negligible covariances, Further, ni, is the asymptotically most ef 
estimate of the class of estimates 
nj MERO TN 


for which 
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egligible. This is a simple consequence of the fact 
(0,,— 1) is the minimum attainable value 
ed from ‘similar’ tagging and that, if 


and for which Ё|(0# —0;) |а. 1] 18 n 
that, since 0; is asymptotically efficient for б, Ө! 
of À,. In (I) we pointed out that nj may be evaluat 
n; = n, the ni, contain between them all of the information on 2. 


2-4. Suppose that catchability is constant throughout the experiment and that e; 


units of effort are expended in catching the ith sample. Then 

qi =e (14) 

where the unknown coefficient of e; is written as à reciprocal parameter for much the same 

reason that, in 52-3, we changed from py to 0; = pi. In previous effort models, it has been 

customary to make the expected catch size proportional to the effort. This is equivalent 
/y), and the equivalent approxi- 


here to writi :ch involves a relative error Ole; 
ing р, = e,/y which involves а i 
ded = eem is obviously too severe. Therefore, accepting d; — 1—e,/y ав 


4), that is, neglecting terms which are relatively (6/9), 
най for еч is p, = e — e;[2y). When it is expedient to 
he fact that they lead to p; t d; = 1 - 61/29? is not, 


to neglect terms olly’). | 
(а-а which gives rise t 


mation for q; is q; 
а satisfactory approximation to (1 
the approximation of the same or! 
do so, we shall use these expressions. T 
9f course, inconsistent with our decision 
Again, consider the conditional density Р) | 
e? = JI (1 е1) 
k 


o the likelihood 


Са алына 


2 lf tence S a 
Hence > = diee Хаа 
Equating this to zero, the equation for y is 
аскек ---Уйе<һкФ 
> 5 25 я 
Зе а-е 7) * 
m that 
and replacing (1- 6/27) by ! +129, We find th Я 
Я " 


gy DLE 
: — the whole experiment since it has not 
Thi А = 0 (and so 
his formula breaks down if X @<r-¥ ( "m 
k d of 5: conside 


Yieldeq a single recapture). Therefore, instea 


is almost unbiased, for the 


fol а Е i nn 
lowing reason. If all e; = © and p; = ? p 
Sider ы 
тұл дети 1% 
y = yat 
К те independent. Hence 
dekh 
Ы p > р] since the | 
а, = ®В[ас P] = B[Xa«r: f Mos un 
piy Has 7 "р 
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Neglecting g=*<+1 and replacing р by (Ely) (1—@/2y), 
Ely’ {аа} = У. 


In the general case when the e, are unequal, it still seems reasonable to assume that y’ is 
almost unbiased. 


To obtain E[(y' — y)? | (а. е may invert 


Ая оны] = еар, 
=r Ха-арь 
on making use of our approximations for p, and dx. The ó-technique shows that 
2 
kPk 


t 
From now on, the argument follows very much the same lines as іп § 2-3. For the paren 
density 


Fi =} faci] = gp HONEA cp). 


%-(4аы-а- ра; 


and 7 has the same value as for the conditional density. We replace it by y’ and ñ; by т; зву. 


d , Р 
efined as Mi = (ааа Це у’) (1. ej") -a..,. 
Hence (n e Design) 1 15) 
94 — "п; а y =y) M^ [(a.,—a2;) — (n; a2) pil- ( 


0<а.,< М, y, where hy, S Yee 


е з is ing expect?" 
tions over {а на), We find that Ir- and Arny is integral, all k, when taking ехр 
Е (т—т,)?] = (ni— a) pt y: (16) 
шы 48 — Xagp, “н ар: +0) 
k 


imates m* which satisfy obvious aru 
at constant catchability and know: m 
by ті represents a total gain as the 


Ba тар рер аа САГИ РИЦ 
т) 1+ (а Acr) Zapi’ 
1 


2 ne 
Mes i x Y? by РЧ. In order to evaluate Fy na he 
First of all, Suppose that p, = p for all р [es Px and about the rate of ішті mi 


А ns 
Suppose that o — hep 18 CO } 
pcdes ^ «к б< n 
W individuals са) P 


first sam 
Wis The formula for p x can be shown to be very ins 
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merely 
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to the i з ; 
A aen assumptions made about the rate of immigration, and we have 
ose which make the formula as simple as possible. It now becomes 


, _ 1+ nr 
® T+[s(s—1)p/2]> 
5 and three values of p. For p < 0:001, the formula 


and is tabulated below for 8 = 
proximation to the previous one. 


p 
в = s(s— 1)/|2(k — 1) provides a very good ap 


в І 
2 3 4 5 

! | 

0-001 9.91 4:96 3-31 249 | 

| 0-01 9-18 44 | 312 236 | 

| 0-1 5:50 3-00 | 217 176 | 

| | 
When we come to estimate the number of immigrants between samples, knowledge of the 
ble gain in information. For, 


esults in à Very considera 
mi) and not only are the two variances 


iance is large and 


effort i 
ort is doubly advantageous and г 
but their covart 


Уа) , 2 ^ 
м m тур) = var (mg) Var (mg) - 200% (mg, m 
ie ler than their counterparts var (nj) and var (nia) 
sitive whereas cov (nj, n1) is neglitible. ТЇ fact, 
2 
(22:43 <i) (2543 7 Y 400) (+j) (17) 
cov (mj, mj) = е Заара 


ons as We did for Fy, We find that 


an " 
d, making the same simplificati 


g yra =! le 
h = var (mj Mra) 3pl^7 


5 and three 


values of p. 


0 


^18 tabulated below for $ — 


| 5 
| 
Nm m 
293 | 
751 30-2 
7 
| 2 | 892 | 
| 


" T pe ye re model used 
25. We mentioned in (I) that the варише-тесар o ringing of 
E dealing with a large accumulation к. е 
ч - It is appropriate here to show ates: 
1 : stu : 
Si ey effectively made um A Jiki elihoods fort 
Y as the product of the individua 


Jihood could be taken 


out. М 
verall like 
irds; and that for any 
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А ample in which 
likelihood need only record its behaviour onwards from the samy 

р | : i i is wav is 

such 4. For s = 3, the likelihood arrived at in this w ay 

it was fir: A 


(18) 
May Ds, 

(рата (фра (Pi qo pa)" (py po pa)" ( Paqa)”: ( papa): pi 

which may also be written pi pé е Әр, 


фіта, 
or, for general s, П pegs 
The maximum likelihood estimate of р; isa 


ЗЕ: tain an 
а-а. Dividing this into а; to obte 
il"«i-l 

estimate of т 


"em imated by 
i We get ас; ү. In other words, the population size at any = тие The 
the number af individuals captured up to that time, which is plainly . with (7) for 
reason why this model is at fault is indicated by comparing (18) with (1') a 
5 — 3. The latter can be written 


! m 
(а-а — 02)! gue pies 
dee! u 3)? (do ра)" аз (p, ЖШ ' ! р 
plu) | (a2, = ede S (4203)^ (р»@)' (qyp3)'is ( р» р» из! gg (19) 


has 
A remark worth making here is that, even if 


ity suc. 
(18) is replaced by a true density $ 
19) and т, is estimated by à А 
(19) ы % = G/B; 
the sampling variance 
also to their covariane 
reasons, we took 


d 
of a; (an 
of ji; is not only attributable to that of р, but also to that of 4; 


‚ other 
: В CAT о ON 2-4 where, for 
€). This remark is of course implicit in $$ 2-3 and 2-4 v , 


Ж = а-а-а а. 


3. DEATH BUT NO IMMIGRATION 


1 
values fro? 
Si, j take all values from 1 to s and suffices k,l all valu ў 
ditional probabilit 


; the ? 
y that an individual is captured кч caug? 
alive. Let a., be defined as the number of individuals 
corresponding definition for a, . р. 


Ы k fter the jth 
probability of an individual not being caught a 
at the time of the 7; 


th sample. Then, for s = 3, 
-ФФ(1- be) + Pide 93d; 

zi $1p,— $i Фәр, 
Xe = 1-¢,4 ads, 


3-1. Inthis Section, suffice 
l to 8—1. p, is now the con 
sample, given that it is then 
after the kth sample, with a 
Let y, be the conditional 
sample given that it is alive 


№ =1-¢,4 


and To FT, m, = i, 
which has an obvious probability interpretation, 
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ті ili 
he probability density of the и, is 


n! 
pu] = (n—7)! Пи 


Wher 
е, for s = 
fors = 3, P, = т, D. 
Py 
Р, = d?» Ps 
Hence P = PT Pros 
n! 
plun] See piny >n? 


П и ial 


w 


(n —r)! 


32. D 
- Define a.) = rand 5 


4 = n! п Пт 
д = П то П і 
ш 


chan 2 s n, the original parameters were 
ged to лу, {л} and {Px} subject to the ? 


mti = l, 
t 


8 therefore 


ers i 
metion of 


ті 
-ne effe е 
ctive number of paramet 


lis is * 
that, if p[(u,)] is written as à 
refore rea y 


а>у-й>зї{>*РЇ 


= 0. Then, for general 


ai- “>i П р> 
Е 


{дь} and {pi}, ® total 0 
onstraints 


and 702717 ma = 
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п" I Ри» 
w 


! 
w^ 
= 41427: 
= pides: 
= Q PT3 
= Pı PT3: 
зарфха 19:210821 ter 
values of 8, 


(20) 


p-ak. >k, 


f 2s— 1. We have now 


0. 


ction of one. The reason for 


23—2, а redu 
{дь} and (pi da and p, only appear in 
Тп other words, they 


лес : 
are ee ФР and are the 
-identifiable. j 
oximizing (20) with Lagrangian multipliers Ay Аз for the two constraints, We find that 
о 
л; = 0, № = f, 
Sra (21) 
PE >k 
йй = 021—1 бый 
Le | 
t us define Ф, = 1 апа otherwise 
th p; = 1027 giv А 
| ]e. Then 
e Probability of survival up tO the time f the ith samp 
Tis. 
Фр = my + inen +T) 
With А | 
Obvious probability interpretation: Therefore od 
= ‚+ (met are | : 
N nO, = mary] Pk p is th expected population size at the time 
jch 18 © 
of the let ii, denote the estimate of n9 uem estimate? NS 
kth sample. Then substituting he à init " 
й„ = ЕИ >k 
k 
Biom. 46 


anq 
he; 
Nee, substituting for Pe 


22 


fk ay 


346 Multiple-recapture census. 11 


3-3. At this stage, the argument becomes so near! 
we can omit almost all of it. The startin 
and (8). Next, n—r and faz; 
т, пу and {7}. Therefore, 

, T 
Plasi} = ( 


- па" П л >ї-1-@>{, 
1 —r) П (21-а)! i 
t 


y parallel to that of §§ 2-3 and 2-4 that 
g point is the comparison of (21) and (22) with (6) 
—45,) are distributed multinomially with parameters 


1 а —_ак ч 
Непсе Рин} {а = Trz 3 II (@>i1—as,)! При > да-аа. i, 
Паш k 
which should be compared with (7). 
Tet. ny = tD last1) | 


йк syl 
Then nz is almost unbiased for ^O, as т), was for ny. There is, however, an interesting 
additional term in the expressions for var (n7) and cov (яу, тр) which was not present in the 
corresponding formulae for iz, т. We find that 


var (nj) = 0924) пФ, +»Ф(1—Ф,)+0(1) 
0р 

emaining constant, Тһе 

me of the kth sample. In 


Cov (nz, nj) = nO(1— 
and the first term is the covariai 
and lth samples. 


In (I) we remarked that, if all à, = 


as n = co, all other parameters г second term is the variance of the 
actual population size at the ti the same way, 

Ф)+о(1) (k< 1), 

nee of the actua] Population sizes at the times of the kth 


5 of the information 
8 ‘differentiated? ing, 


tagging 
3-4. If q; = е-е? and the % are known, it is stil] true that 
Nn = а-а. 
Непсе АФ, = (а. i17 054)/ 5; + azi. 
Let Fa 2 (4. — 34, р) €. de 


2 а, +1 * 


(Mi), cov (mt, m), i < j, 
: SF = var (п var (my) is 
ms may be hes because, although they йн, the same 
Y comparison. Therefore 
„_ 1+ (а € | 
Pre cate 91) 
ТЕ (asra =. 


80] “> р,” 
The simplifying assum 


ptions are now th 
Taken in conjunction 


at [o 
и, Қ 6%, — 02. is constant for all Ё. 
nsecutiye E ney 1-4, = nD, P, that is the 
: es is const, j the 
Plifications ant and equal to 
Ру = Pre (Ж = 1,9 ы. $—1) 


_ Аў 


Y 


1 
; 
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" ” 
var (ny ,— h 


и) _ (у 


$+2—й (һ = 2,...,8—1). 


ake, бұ var (m; 4— my) 

The experimenter will probably be more interested in estimating the survival probability 
Фу than n®,_,—n®,, the expected number of deaths. Both m;[n; 1 and mj|m; ате 
biased estimates of б, , but the biases are О(п-1) and can easily be evaluated, if desired, 
using the 6-technique. The d-technique also shows that, neglecting terms O(n-1), 


var (nijn) И 
var(my[m; ,)  var(m;— Q, тд 4) 


and, provided д, , is not too much less than one, the latter ratio is well represented by G7. 
3-5. Throughout (I) and so far in this paper, we have been exclusively occupied with 
estimation and have said nothing about testing hypotheses. In this subsection, the likeli- 
hood-ratio method will be adapted for the purpose of testing hypotheses about the values 
of {фи} and (pj. 
Consider any density of the form 
n! 


Ри, = n» ТИП EY П Ру, 


“Шы!” 


Where Q and the P, are functions of parameters {0} say. Let the log-likelihood of n and (0) 
be L, = log p[{u,,}]. Next, consider the conditional density 
II Ру 


w 


ptu) |7] = Tad a-9y* 


and let the corresponding log-likelihood of (0) be L, = log ра.) |7]: 
Let Z, and Г, denote the maximum values of L, and 1». We show first that, to а good 
approximation, Ĉ differs from Ё, only by an additive constant. Equating 0L,/20, to zero, 


т-ғ00 2i; ОР i 
-— T — — = 0. 23 
0 $0, = P, 20, (3) 
Equating A, L, to zero, logn—log (n—r)+logę = 0. (24) 


(24) gives (n—r)/Q = r|(1— 9) and, substituting in (23), we have 


But this is seen to be the equation 0L,/00, = 0. Therefore, the maximum likelihood estimates 
Q and P. of апа P, are the same for L, and Г; and for L4, % = r/(1—Q). To find Ё, and 1,, 
ап approximation for п! is required, and we shall assume that n! = Kean», (This approxi- 
Mation is equivalent to our practice of maximizing L, with respect to % by equating A, L, 
to zero rather than dL, /en. For, if the exact identity A, log т! = logn is replaced by the 
арртохішаде one ?logn!/óm =logn and the latter is integrated, it leads directly to 
n! = Kenya, Using this approximation instead of Stirling’s, n! = /(277) ew, results in 


22-2 
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қ : ; d 
i ikeli i fference which can be ignore 
i um likelihood estimate of n, a di 
an increase of 4 in the maxim 
as it is O(n—).) Hence, 


Г. -ЯПовй-1|-(й--”) [og (#—r)— 1]-log П ty! + (—r)log Q + X 9,106 Р, 
1- ы w 


= r[logr— 1]—log П ty! —rlog (1—Q) + D u,, log P,, 
= L,—logK, 
which proves our assertion. 


Let H, denote a hypothesis which reduces the number of degrees of freedom of (0j) by 


v and Н the alternative hypothesis allowing 70) full freedom Then, as L, has the properties 


required for maximum-likelihood large sample theory, we can say, using self-explanatory 
notation, that 


2 =- 2L, m — Do п] 


is approximately distributed as X? More precisely (Bartlett, 1953), 


b 
Pla € z, « blr] -Í Лодау--О(ғ-) as ғ о. 
Let a= в, —Ly 7). 


Then, since Г, y = L, н-К and, similarly, и, = D, m К, 


T1 = ty. 
Therefore, if Р. is the probability of r individuals, 
Р[а<а <b]= y F PIS < x, < b |r]+ X РРа< v, < b|r], 
T<hn 


r>hn 


where 0 < № < 1—Q such that hn is integral. Now 


У F, Pla <a, < Ы 


I< У Же O(n-ter) = o(n-1), 
rEhn $n 
1-0% Q yia 
where c= (59 | (ғ) <1. 
b 
а ачаа х "| / 0538-001) 
b 
= =o) f Л) d + О (тру, 

Hence, 


Pla xa, <b] = [foa cog, 


which is in the same form as the usual likelihood-ratio test 

There are two applications of this test for whic | 
In each сазе H denotes the hypothesis allowin: 
and {ру} and it is therefore 


a easy to find d РА А 
Lir, can be found from (I) and v = (2s— 2) 


accepted it will of course not mean t 


h the algebra has alre 
5 complete free 
5 the first exam 


ady been performed. 
dom to the parameters (Ф) 

ple, consider Hy: ф„ = 1 all k- 
728—2, (If the test dictates that H, be 
hat we believe it 


to be exactly true since this 18 


ex 
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strictly impossible for an animal population. It will mean, rather, that the фу cannot be 
regarded as reliable estimates of the ¢,.) As the second example, consider Hy: q; = е-%? 
where the e; are known. This will test whether or not catchability is constant. L, g, can 
be found from $ 3-4 and, again, v = (28-2)-8 = s—2. 

Finally, a remark concerning hypothesis testing when there is immigration but no death. 
The above adaptation of the likelihood-ratio method is not applicable to the density (3). 
Тһеге is, however, a way round this difficulty. We can set up a formal model for immigration 
but no death which is the exact inverse of the model for death but no immigration. Namely, 
let the population be of size at the time of the last sample and let 1— у. be the con- 
ditional probability that any individual immigrated between the (k— 1)th and kth sampies 
given that it was in the population at the time of the kth, Æ = 2, 3, ..., 5. This model yields 
just the same estimates as (3) and any likelihood-ratio test can be written down automatio- 
ally from the corresponding one for death but no immigration. (For this model Ри, is 
most quickly arrived at directly as a multinomial probability, but it can also be developed 
as a chain of conditional probabilities P[S;] PUS; | 51]... PES; | Sy, .....S, ,] just as easily as 
for any other model. The main snag about this description of immigration is that it cannot 
be combined with stochastic death. To include both immigration and death, densities (3) 
and (20) must be ‘combined’, and we do this in the next and final section.) 


4. IMMIGRATION AND DEATH 


41. Fors = 3, the generating function Z[T] tw] of p[{u,,}] is clearly (cf. (2)) 
б 


(фа + ра + Рф Роза + Р @1Фз P2Pstia + Pi Pi Pa ФЗ аз 
-- di Фр Хо + dy Oi Da Do Palos + di O1 do Po Palo) 
X (qo Xa + PoXzl2 P2P2Pates + JoPo Pata)" ™ (45 + рз)", (25) 


А р 1 шіп (25 7 
Where y, and y, have been defined in $ 3-1. Тһе coefficient of П tj in (25) turns out to be a 
double sum of probabilities which only contracts to à single probability if», = n, or $, = 1 
and if n, = ng or dy = 1. 


: i i robabilities which cannot be written 
In general, p[(u,,]] is an (s— 1)-dimensional sum of proba te 


as a single expression unless, in every interval between successive samples, there is either 


no immigration or no death. 

4-2. The unwieldy form of p[f{z,,}] clearly rules out estimation by maximizing the like- 
lihood, but there is an obvious pointer to an alternative method. In all cases when there is 
no immigration or no death, the method of maximum likelihood is equivalent to equating 
Some of the class sizes to their expected values. We recall from (1) that, for a closed popula- 
tion, these are (а) and r. It is easily shown that, for immigration but ри death, they are 
{a,}, а.ҙ,...а.,апа г, and, for death but no immigration, they are {а}, r and а. 1,..., аә. 
Therefore, when there is immigration and death, it seems eminently reasonable to equate 


(aj. Qoa s ug 7, Opes s-2 (26) 


to their expected values. We shall indicate how to solve the resulting equations by sup- 
Posing that there are four samples, this being the minimum number from which the general 


Solution may be inferred. 
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For s = 4, the aforesaid equations can be arranged as 


бал = NPr pue 

а, = [1,91 - 15 n] Do, = қ 

dg = [1 612+ (n4 — M) фтп] ps, - c) 

а, = [19105 5 + (n4 n4) Pops + (Ng — Nne) Pa + ng] p, (27d) 
T—051 = тру, (28а) 
0,1705. = [nmp +n n] pa xs, (28b) 
553—053 = [m $163 - (n —n4) 3 - 3 — no] Pa Xa (28c) 
б<в = pi, (29а) 
@<з—а<» = [19,4 4-15 — n] po, (290) 
@<а—@<3 = m $15914» + (n5 n) $24 + i — n5] ps, (29 с) 


where (27а) reappears, for convenience, as (29a). 


Combining (27 a,b,c) with (28a, 0, c), we have y, = ( 


@>к-1—@>ь)[а Orl — y, = a, „ь[й› 
k = 1,2, 3. There is а recurrence relationship between 


successive y, which may be written 


idea — Аа) +Фьрыд = l-X& (k= 1,2), 


Hence $1222 фр, = зым, (30a) 
2 1 

фа E= фр, = аа, (300) 
3 а 

1f we multiply (29а) by фур», add it to (290) and refer to (270), we obtain 
cs — Weg - Pi paa. = а, 
a Ф94а- LEE (31а) 
Similarly, multiplying (294) by Фуф»ру, (295) by ф,р, and adding them to (29c), 
Papia. —a..,) +. opua, = (s a. (315) 


Equations (30) and ( 


; Ds and фу, do, Hence, from (27 5, c), we 
can estimate т; ф, +, 


) 9+ т Na, the expected population 
: Also тъ т, the number of immigrants 
88 not proved to be of any use. 

>Ps and hence for the 


of any method of estimation. 

Since the distribution of the variables (26) is simply a combination of independent multi- 
nomial distributions, it is theoretically straightforward to find formulae for the variances 
and covariances of all the estima 


| tes by using the 9-technique, However, it would be $00 
tedious to compile these formulae for the Seneral case and we shall not attempt it here. 


me 
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453. Ша; = eY and the e; are known, we can again appeal to moment equations for 
the purpose of estimation. In (I) we remarked that, for a closed population, maximizing 
the likelihood is equivalent to equating X a;e;/p; and r to their expected values. When there 

i 


isimmigration but no death the corresponding variables are X a;e;[p;, a... (= 03); esi ciii ng 
i 
and ғ, and when there is death but no immigration they are X a;e;/p;, r and sss ИЕТ 
i 


1 (= а,). With both immigration and death we therefore equate Xa;e;[p;, des, ..., б 
T, @ъу,...,аъ„_у to their expected values, thus providing 2s equations. These enable us to 


estimate all of the 2s unknowns у, {л} and {¢,}; except for s = 2 when the a, e;/p, + ases[p; 
equation is implied by the az, and a, equations and there are only three equations for the 


а 


four unknowns. : 
Without actually performing the straightforward but tedious task of finding the variances 


of the estimates, we can be confident that knowledge of the effort provides very considerable 


gains in information. 


I am indebted to the referee for his most helpful comments. 
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ON AN EXTENSION OF THE CONNEXION BETWEEN 
POISSON AND y? DISTRIBUTIONS 


Ву №. 1. JOHNSON 
University College London 


1. If a random variable x has a (central) ү? distribution with v degrees of freedom then 
the probability density function of x is 


gba ет 


P(x) = Ty (е > 0). 


The probability that x is less than a fixed value X (> 0) is 
X X g-by—1 6-4 da 
Рг{ < X) -f p(x) da: -f T ` 


If vis even, then integrating by parts we finally obtain 


+. 


Pr (s « X) = reip BE ВЕ ur] 


ТИ 


аа ГО» ха 
cr iex 


= РЕ > p}, 


(1) 
where £ is a Poisson variable with expected value 1X.* This relationship between the 
Poisson and ү? distributions has proved of use in Several cases (e.g. table 7 of Pearson & 
Hartley (1954) and particularly Cox (1955)). 

In this paper an interesting extens 2 distribution with 
the distribution of the difference between two independent Poisson variables, is discussed. 
А similar relationship is found between non-central р and the difference ' 


binomial variable and an independent Poisson variable, 


10n, connecting the non-central x 


between a negative 


v 
2. Let a’ = E (u +a}, (2) 
de 
» Sie; № Ug, ..., €, are v independent unit Normal variables and ауа), ..., а, are constants 
mr - — À. Then the probability density function of a^ is 


де) = PUN atte te | 
j=0 J! Ty ұй (x > 0). 


This distribution of а is called the non-centra] y2 distribution 


with v degrees of freedom, 
* £ is used here (and £ 


6 present а random variable. This į tice 

E = . 1 ractice; 
but the connexion between Ё and its mean 2X is similar to that Ded 5 Een | to usual pe its 
mean $A. еп 4 (introduced later 


E 
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and non-centrality parameter À. Tf v is even, then, integrating with respect to x’, and using 
repeatedly the procedure by which (1) was obtained, we find 


а CHA ax/ Ge QAH | " 


Pr(a«X)- P ў (ара) ^ Qvij*0) 


This ean be written © А Б 4 
Pr (a' < Х\= У Pril = ј}Рг{ё > wj} 

350 
where & is, as in $ 1, a Poisson variable with expected value X, and 113 a Poisson variable 


with expected value JA. If we further suppose that 5 and Г are independent, (3) is equi- 
valent to Prix’ < Xj = Pri£-l2 iv). (4) 


Тһе relation (4) is new, so far as the author's knowledge goes, though Patnaik (1949) 


points out that Pr{x’ < X} сап be expressed as “а double Poisson sum’. 

3. If either side of (4) could be approximated it would, of course, provide an approxi- 
mation to the other side. 

It can be shown in straightforward fashion that if k > 0 


ӨККӘКЕТІР ел) ше ? 

қ = R = (XJA ke5 У) в ХА = 
АЛМА 

= (ХА) e-*» IQ: ХАЙ, (5) 


where 1,0) is the Bessel function, with imaginary argument, of order X (cf. Irwin, 1937; 
Skellam, 1946). (For k < 0 the only ‘alteration’ in the formula is the substitution of —k 
for k as the order of the Bessel function. However, in evaluating (4), negative values of 
k cannot arise.) 

With adequate tables the sums of terms such as those on the right-hand side of (5) 
conld clearly be evaluated. However, Patnaik (1949) has shown that a simple (central) y? 
approximation to the non-central y? distribution gives quite good results. 

It appears so far, therefore, that it is doubtful whether (4) helps us approximate non- 
central y? integrals: it seems more likely that the converse process of approximating dis- 
tributions of differences of independent Poisson variables by non-central y? approximations 
will be useful. However, if neither 4X nor 3A is too small, a Normal approximation (expected 
value (X — A), standard deviation ХХ + А)]) to the distribution of ё — 1 should be justi- 
fiable. Using the natural continuity correction this leads to 
co 


Pris < Х} = т e? de, (6) 


Am) 
Formula (6) has been obtained by assuming v is even, but considerations of continuity 
indicate that it will be also a good approximation when и is odd. i 
It is to be expected that the accuracy of (6) will increase with A (and also with X). Pat- 
naik's first non-central ү? approximation, on the other hand, is more accurate for small 
values of A, i.e. when departure from central y? form в not great. Hence one may hope 
the two approximations will be complementary, Patnaik's approximation being used to 
approximate the distribution of the difference of two independent Poisson variables when 
Дің small, and (6) used conversely to approximate to the non-central Л? distribution when 
Ais larger. The relation (4) is thus used to provide approximations for each of the two dis- 


tributions concerned. 


-ю 
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4. The numerical comparisons in this section, while not extensive, аш О. 

i dicate the usefulness of approximations based on (6). In so far as extensiv ё : B 
d become available, especially for greater values of k, exact probabilities, for 
d = Fiete distributions, could be evaluated exactly with some facility. Such tables 
min ipee other applications, e.g. in considering differences between experimental and 
m (background) counts. Such tables may indeed already be in existence and used 
for such purposes. 


Approximate upper and lower 1002 % points for the non-central X? distribution may be 
obtained by solving 


А+ _ | (7) 
V[2(X +A)] Е 
1 © 
for X, where 


25 -h* dx = д. 
Ул) Aa v 0. 


Table 1 shows (i) some exact values of upper and lower 5% points taken from Patnaik’ 
table 2, (ii) the errors in Patnaik’s x? approximation, (iii) the errors involved in determining 
X from (7) above and the results (( 


iv) and (v)) of two other approximations referred to below. 
In our Table 2a direct probabilities calculated from (6) are compared with exact values 
from Patnaik’s table 6. 


8 


Table 1. Upper and lower 5 % points of the non-central X 
(G) Exact values. Errors 
(iv) approximation (7); 


2 distribution 
(approx. — true) in: (ii) Patnaik’s x approximation; (iii) approximation (7); 
3 (v) Pearson and Merrington’s approximation.) 


I 
| 


Upper 5 % ] 


Lower 5% 
»| А 


(i) (ii) Gi) | (iv) (v) 


(i) (ii) (iii) (із) (v) 


864 | —0-01 | 092 | 0-57 | -ooa | 017 003 | x * — 0-09 
4 1464 0:08 | 0-55 | 032 | —0.06 | 0.65 039 | —043 | 0:28 | -012 
16 | 33-06 | 029 | 0-28 | 046 | —0.03 | 632 | 0-57 | 0:25 | _ола | —0.02 
25 | 4531 | 035 | 023 | 013 | боз 1208 | 0-60 | —o21 | ола | 2001 


to 
m 


4 l n7! 0-01 | 0.88 | 0:50 - 0:02 0-91 
4) 1731 | 0-07 | 


0-02 | —0-07 | —0.06 | —0-03 
| 
| 957 | 033 | —0-04 | 1.77 0-18 i 
| —034 | _0. — 0-06 
s = Me | 0-30 | 0-17 | —0-03 | 788 | 048  _ 0290 018 — 0-02 | 
. п „95 H — (0 | . | B | 
| | 0-25 | 0-14 0-02 | 13-73 | 0-58 -017 | 0.11 | —0-01 | 
7| 1 | 16-00 9-01 | 0-83 | 0-47 | - 00 2-49 | 0-09 0-10 0-02 0-00 
е з | is | 059 | 034 | —0-02 | 3.66 012 | —0.07 0-06 | —0-02 
: "9 | 988 | 018 | —0-02 | 1096 | zd 
| | | 10-26 | 0-38 | _o. Emi — 0-01 
| 25 | 51-06 028 | 0-26 | 014 | —0.02 | 1623 | 0-45 и Fe — 0-01 
| | 
* Formulae (7), 


(7) give negative values in this сазе. 


Тһе relative accuracy of approximation (6) and (7), and 
Table 1a), varies with v, A and the limit of 


) Patnaik's X? approximation (in 
д Integration, 
cipated in $3. For small А, Patnaik’s i 


X, roughly, but not quite, as anti- 
er of the two approximations. Аз А 
Over point (of about equal accuracy) 


ib 


e 
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is Bb a considerably lower value of A for the lower 5 % point than for the upper 5 % point 
This is rather unexpected, though the proportionate accuracy of (7) is greater at the u вр 
5% point as would be expected. (The results for v = 7, lower 5 P points, are ы. ды 
anomalous, but the differences vary systematically, giving no indication of incorrect values.) 


Table 2a. Probability that non-central у? exceeds upper 5 % point 
of corresponding central x? distribution 


(i) Exact values (from Patnaik); (ii) approximation (6).) 


Table 20. Comparison 


Patnaik); (ii) Patnail 


((i) Exact values (from 
(6) modified by (6); 


(iv) approximation 


А 2 4 8 12 i 
v | | 
X à | à ооо оо | @ | @ | @ 
| | 
| | 
& | ба | oz | oë | оша | 0718 | OTIS | 0880 | 08178 | 0989 | 0951 
$ | aise | gros | 0:387 | 0:852 | 0685 | WEEL | OBST | 0857 j 0:874. | (00871 
+ | смт | oire | 0320 | +00186 | вов | 0:60L | 9:809 | 0788 | Ogas | qapi 
6 | 0146 | 0152 | 0270 | 0266 | 0-531 | 0-526 | 0-738 | 0-735 | 0-940 | 0-938 
8 | 0131 | 0126 | 0238 | 0235 | 0477 | 0-472 | 0.685 | 0-681 | 0-916 | 0914 
12 | 0:118 | 0-115 | 0198 | 0197 | 0-402 | 0.395 | 0-601 | 0-596 | 0-866 | 0-866 
20 | 0-096 | 0-101 | 0158 | 0.159 | 0-315 | 0-310 | 0-489 | 0-483 | 0-775 | 0:775 


of approximations to the non-central x? probability integral 


k’s Normal approximation; (iii) approximation (6); 
(v) Pearson and Merrington’s approximation.) 


x v А | G) | (ii) | (iii) iie) T 
à | = 
30 16 32 0:0609 0.06388 | 0-0634 0:0632 өзінен 
60 16 32 08316 | 08320 | 08310 0-8308 0-8320 
36 24 24 01567 | 01515 0-1577 0:1577 0-1563 
72 a | 24 | 0.9667 | 09686 0-9644 | 0-0665 ber 
| | | 
5. Patnaik also gives а № ormal approximation, which can be obtained by applying the 
relationship JGx3)- A4(Q»— 1) = unit Normal variable 


to his 4? approximation. This lead 


Pr{a’ < X} = == Г et? da 
~ (Ол) ) 
Р 2X(v4-A) 2(v- A)? 
lanas z= a -A v 2À -1). (8) 


ч А few numerical comparison: 
in Table 90, The two approxima 


s to 


indicated by these four comparisons. 


tions appear to be of 


s of (6) and (8), using values in Patnaik's table 3, are shown 
about equal accuracy, so far as is 
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Тһе limit of the integral in (6) is of the form 


Too 3а (9) 
A4CBX +С)? 
while in (8) the limit is of form D4X —E, (10) 
where 4, В, C, D, E are constants depending on v and A. (9) is equal to zero when 
X=A=A+p-1, 
(10) equals zero when a йок — 


The latter values lies between Ac v—$ and A--v—1; hence the differences between the 
median non-central ү? given by the two approximations must be less than 4 


1, and approxi- 
mation (7) will always give a smaller median value than Patnaik’s Normal approximation. 
The forms of expressions ( 


9) and (10) indicate that good agreement between the two 
approximations cannot be ex 


pected over the whole range of values of X. When X is large. 
for example, the ratio of (10) to (9) is approximately 


2V+A) о) о [vA _ ,, 
p48- (0з) = [+ > р. 


Hence for upper percentage points we would expect (6) ( 
of X considerably larger than those given by Patnaik’ 
the latter should give larger values for Pr {а < 
although the differences in values of Pr 
because the values of 


or equivalently (7)) to give values 
в Normal approximation. Conversely; 
X}. Table 2b indicates that this is the case, 
іш < X} are not remarkably large. This may be 
X involved are not particularly large. 

is exact. The accuracy of approximation ( 
ximation to the distribution of the differe 


- This accuracy should be rat; 
mation to either Poisson distrib 


coefficient of the distribution of 


6. Equation (4) 6) is essentially the accuracy 
nce between two independent 
her greater than that of a Normal approxi- 
ution separately, especially if X + A. The Af, skewness 
E£—lis 
42(X —2) 
(XA 


А natural improvement to the Nor 
term in the Gram— 


1 X-A E. 1 (2-4 2 

от | =з =] " _ —À—r41) 

342 (XAL ЕЛ) Те» E eva “| 
to the right-hand side of (6). The results of adding this со 
of Table 26 are shown in column 


. rrection to the values in column (iii) 
(iv) of this table. 
Another method of Seeking an im 


variable by a multiple of a Х variable such that the two y. 
standard deviation, Е, | are then replaced by іх, Ix where XX. X? denote central Y? 
variables with X, A degrees of freedom respectively, idis 

From (4), we then obtain, using the continuity correction 
Pre! < X) = БОҚ > yyy, (11) 
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In general this result would not be useful as a basis for computation, since the distribution 
of y& — X3 is not simple to handle. If, however, one (preferably both) of the quantities X 
and A are even integers, then the probability density function of уз — y$ can be obtained 
as a simple finite sum. If min (X, A) = 2 there is only one term in the expansion, if 
min (X, A) = 4 there are two terms, and so on. 

Just as approximation (6) was modified by (6)' to allow for the skewness of distribution 
of £—1, we can also modify equation (7) for the percentage points of the non-central у? 
distribution. Using the initial terms in the inverse Cornish-Fisher (1937) expansion we 


obtain 9 (r—1) X2) м, 42 (X +A)? M 
BART X+A+#(u2—1) i (7) 


This is a quartic equation in X, which could be solved by an iterative method. Computation 
is rather simpler, however, if (7)' is used to obtain the value of X — А and hence the values 
of X and A, corresponding to a given value of X +A. From the table of values of X and A 
so obtained the value of X, corresponding to a given А, сап be found by interpolation. 
Column (iv) of Table 1 gives the errors in values of X obtained by this method. 

Columns (v) of Tables 1 and 20 give errors in values of X, and computed probabilities, 
respectively, obtained from a new approximation due to E. S. Pearson and Maxine Mer- 
rington, which is described in a Note at the end of this paper. 


7. We now consider the non-central F distribution. This is defined as the distribution 


of the non-central F-variable 
(non-central x? with v degrees of freedom)/v 
„  (non-e à 
dim (central x? with ry degrees of freedom)/v, 


, 


the numerator and denominator being independent. The denominator will be written, for 
brevity, as v/v). Using (4), the conditional probability that F” does not exceed X |», given the 
A 4 


value of v, i | | 
Би Pr(F' < X[v|vj = Ре < Хој} 


= Prí -l> b) (12) 


ariable, independent of l, with expected value $Xv[v,. 


(if v is even), where Ё, is a Poisson v: | У 
) 2 tions. Averaging (12) over the distribution 


?' and J have the same meaning as in earlier sec 
of », we find the unconditional probability 

Pr{F’ < Xj} = Pr(£ -l> Wh (13) 
Where £' is a random variable, independent of l, with distribution obtained by averaging 
the Poisson distribution of &, over the distribution of v. Thus 


1 ш (4X2/¥%)! хо” јр 
em DEL о 00 2 ео 
Pr 7 7 ы,” SEN 


"mm (X[2v9 р оні e PRU Xb dy 
озю у) Л Jo 


rarer (XV +" оа.) 


Vo, 
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аа. 15 inthe 
that £'is a negative binomial variable with integral probabilities given by terms in th 
so 
expansion of [a 4 X [vy — X [vg] rs. 


Р in 
Equation (12) implies therefore that the distribution of non-central F can be expressed 


қ . . ы . Pu d 
terms of the distribution of the difference between a negative binomial variable (&’) an: 
an independent Poisson variable (1). 


Тһе expected value and variance of £' are 


X z 
ото 80 
X 
and ы ( +3) -4Х ( + ) 
vo Vo, Vo 


respectively. Hence the analogue of approximation (6) is 


н = 1 (Х—А->+1)/у/{9(Х+А+ Хи] EM | (14) 
Pr(F “ХВ + т. et do 
This differs from (6) only in the term X?|w, Table 3 gives approximate values calculated 
using (14) compared with exact values and values of an approximation, using the central 
F distribution, obtained by Patnaik, taken from Patnaik's table 7. Table 3 also gives коше 
further approximations analogous to those obtained for non-central x? by the modification 
(6) of (6). The corrective term to be added in the present case is 
1 X0-2XDgQX[x)-AT(X-A- v4 IE alb И (Х-А-у- r£ |. 
3/2 (ХА Es +А+Х и) | Дел) 5? ахо 


The exact distribution of E 


large | 
— can be expressed in terms of the confluent, hypergeometric 
function —— | 
М(а,с, 1) = 160) x ы) ui (15) 
Г(а) = Г(с T3)! 
We have for k > 0 
, S оа (2А) Гду j) (Хұн V 
Pr’-l=B= ye з (6 0 я X -s-k-i 
| ЕС. Jj Pv) Piin. "c 


T(3») k! Vo 
Direct evaluation of ( 
hypergeometric function 


_ Ty) +h) e ( к ie 


", we can use an um 
А © quadratic equation (in X) derived from с 
һе corresponding equat = vx (significance limit of F » 
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8. In this paper it has been shown that the relationship between central y? and Poisson 
variables can be extended in two ways. The cumulative distribution function of a non- 
central X? can be expressed in the form of the upper tail of the distribution of 

(a) (Poisson variable)-(Poisson variable), provided the number (v) of degrees of freedom 
ік even. 

Also the cumulative distribution function of a non-central F can be expressed as the 
upper tail of the distribution of 

(b) (Negative binomial variable)-(Poisson variable), again provided the number (v) of 
degrees of freedom of the non-central y? in the numerator is even. 


Table 3. Values of the non-central F integral, Рг{Е’ < X] 


((i) Exact values; (ii) Patnaik's central F approximation; 
(iii) approximation (14); (iv) approximation (14) modified by (14)’.) 


d | | | 

v Vo А Xl | (i) | (ii) (ін) | (iv) 
—€— | 

3 10 4 3-708 | 0745 0-752 0-755 0-736 
4 | 6552 | 0-918 0-919 0-889 0-901 
16 3-708 0-206 | 0-203 02319 | 0-213 
16 6-552 0-517 0-520 0-544 0-517 
|* 3 20 4 3-098 0700 0-706 0-711 0-694 
4 4-938 0887 | 0889 0.873 0-879 
16 3-098 0-126 0-119 0-129 0-130 
16 4-938 0-347 0-350 0:364 0-350 
5 10 | 6 3-326 0-731 | 0-731 0-746 0-726 
6 5-636 | 0914 | 0-913 0-886 0-903 
24 3-326 0-158 0-157 0-165 0-165 
г 24 5-636 0-461 0-463 0-504 | 0-461 
5 20 | 6 2-711 0-664 0-665 0-680 | 0-660 
6 4.103 0-870 0-869 0-959 | 0-862 
94 2.711 | 0-069 0-064 0-068 0-072 
24 4-103 0:245 0:244 0-257 0-249 
3-072 0-714 0-715 0-743 0-718 
| м 9 5:057 0-908 0-909 0-882 0-893 
36 3-072 0-119 0-117 0118 | 0195 
36 5:051 0-408 0-409 0451 | 0-409 
г 9 2-226 0-578 0-581 0-596 0-576 
қ Di 9 3-173 0-813 0-815 0-813 0-809 
36 2226 | 0-017 | 0014 0-015 0-017 
| 36 3-173 0.088 | 0085 0-086 0-090 


For completeness 16 is interesting to inquire whether the upper tails of the distributions of 


able)-(Negative binomial variable), | 
1 variable)-(Negative binomial variable) correspond to the 
f any continuous variables of well-known types. 


(c) (Poisson vari 
(d) (Negative binomia 
Cumulative distribution functions о 
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The second variable in (a) or (b) can be changed into а negative binomial variable bd 
replacing the fixed expected value, $A, by a random homm with the 2. ace 
multiple of (central) x?. In the special case when this X? has the same number (v) o өр : 
of freedom as the original non-central x? this would correspond to a componen: of variano 
analysis of variance model (e.g. Johnson, 1948) as opposed to the кердең model repre- 
sented by a fixed A. In this situation the non-central ү? becomes a central ү? with the same 
number (v) of degrees of freedom. 

Hence, if v is even, (c) is then associated with a central ү? distribution with v degrees of 
freedom. This is, however, only a special case of (c) and, even in this case, the original 
formula (1) provides a more useful result. When 1A is a multiple of a central X? with arbitrary 
degrees of freedom (v) there seems to be no simple continuous distribution associated with 

(c). But Ши < v we might regard our modified non-central y2, 27, as arising from (2) in à 
mixed type of model, for which the a,’s are random variables but are not linearly indepen- 
dent. Then (c) would provide (for even v) the discontinuous counterpart. 

Similarly, (d) can be associated, in a special case, with a central /'-distribution with an 


even number (v) of degrees of freedom in the numerator. Again, this is only a special case 
of (d), and the relation 


Pr(F' < Хм = Pr{é' > y 
obtained by putting A = 0 in (b), would be preferable for most purposes. The remarks about 


possible interpretations of the general form of (c) in terms of a mixed form of model apply 
to (d) also. 


9. Finally we repeat the procedure used in obtaining (12), but replacing the central А 
by а non-central x?, with v, degrees of freedom and non-centrality parameter Ag, which we 
will denote by ҳи. 

Since the probability density function of X? ean be written 


2) = Усы, №! 
PO) = Хе С xt: 


it follows that the averaged £, is now a mixture of negative binomial variables having the 
probability generating functions | 


ИХ) Хао Wa 
with probabilities e!» (yj! respectively (j — 0,1,2, ...). Hence if P" = С ]v) is] vo 
represent an F-ratio based on two non-central 32° then, if y is even и 
Pr(F" < Хр = 
j 
where &; is a negative binomial variable ind 


? à ependent of 1, having pr bability generating 
function (17). We note that &; has the same distribution as the : ce ove ata g which 
appears іп (13), with v, replaced by vo4-2j a 4 E INR 


nd X replaced by X(vo+2j)/vọ. Since 
(8) = 4X (1+ 2j], 


var (5) = 2X (1+ Xy (14 94/ 


err 
X chro? Р —1> p}, (18) 


and 


= 
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a Poisson distribution with mean 1A,. The first term in an evaluation of this expected value 
by the method of statistical differentials, for example, would be Pr(£"—1! > 4v}, where 
=" is a Poisson random variable with mean 1A,; usually, of course, further terms would be 
needed. 

10. We had, in the last paragraph of $9, a ‘mixture’ of negative binomial variables (see 
Robbins & Pitman (1949) for formal use of the term * mixture). The concept of a ‘mixture’ 
of variables has a number of other applications in the present context. We can regard the 
non-central y? distribution as a mixture of central X? distributions, the negative binomial 
distribution as a mixture (with continuous weighting) of Poisson distributions and the 
non-central G-ratio distribution as a mixture of central G distributions. Here, central 


2 


G,,,, is defined as the ratio of a central x? to an independent central үр; for non-central G 


e f "^ H : 
the numerator becomes а non-central 2, x;?, say. A reasonable type of approximation to 
investigate is that obtained by using а mixture containing only two of the component 
distributions. This is a natural first extension of single component approximations of the 


type used by Patnaik for non-central X? and non-central F. 
We will find it convenient to use the following lemma (Jones, 1933). 
LEMMA. 112+ (1-6) 25 = k, (r= 1, 2, 3) then тү, гә must be roots of the equation 


(К„— 18) z? — (k3 — ki ka)z + 4-13 = 0. 


This result follows from the equations 


к= kik = с(1—с) (2+ 25) (5 — га)», 
hk, —k$ = c(1—0)2 zy(24 — 20). 
(i) Non-central у? 


If y/? be а non-central x? with v 
NN их) = vA. m) = 20+ 2) + (V +À}, 


degrees of freedom and non-centrality parameter A, 


(Ху) = 8(v + BA) + 6(v + 2A) (> +A) + (v +A}. 


1, Ya degrees of freedom respectively, in the ratio 


If ix 4 tral y?'s with v 
у be a mixture of two central у әйт iter 


с: (1— c), then, equating the first three moments of 
cv, +(1—¢) Pp = v+a, 
ov? +(1—c) 8 = (V+ A)? + 2A, 


od (1—0) = (VEAP + OA +A) + AA. 


Using the Lemma, vı, v, are roots of 
(А+ zt (VEAP + = 0, 


50 yy = VEAF14(2A4 1) 
у. = ›+А+1—/(2А +1), 
апа с= 40 — (224 1%), 


Biom, 
23 m. 46 
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We can approximate similarly to the distribution of ауы +а, 2. We obtain equations 
су + (1—с)ь. = W, 
ся + (1-е) = Way, 
i+ (1-6) = 3+6 У+ 47, 
where W-agwrag Y 


Z 


= ala, — 1)w,  as(a, — 1) we, 
-а(а1-1) (2a, — 1) +а,(а,— 1) (2a5 1) we. 


We find 9», = (Z+ WY eJ: УЗ] у, уз = (Z+ WY- (Z2 - Уз] ] У, 


с = i - ZU уз)-ң, 
1t will usually be convenient to arrange that the variable is ‘standardized’ by making 
аа = 1. 


"This result can be used in approximating to the distribution of modified t, 
y = “(910 + aswa) 
NICA Aun “а, №.) D 
Where v is a unit Normal variable independent of the x? 
connexion with some criteria for testing differences betwee: 
with unknown variances (Welch (1938)). Applying the appr 


Prt! < в} = срь fh, < ts а, V, + d5w;)] 


s. Modified 4 is encountered in 
n means of Normal populations 
oximation of this section we have 


%(1-с)Рг %,<% УГ» (aw; + a, w;)])- (19) 


ilarly be expressed in terms of non-central t 


Then the mixture, in the ratio c: (1—c 


| ), of two Poisson dist 
will have the same first three moments 


ributions with means ті, nts 
as y if 


em, +(1—c)m, = np, 


omit (1c) mg = n(n 4. 1) ра, 
emi + (1— c) m = nin 1 3 
m, and m, are the roots оғ, 2 e )(@+ 2) рз, 
2 —2(n4 1) pz--n(n 4-1) р? — 0, 


8o ™ = PAR (т 1]4-1) 


; "e= PI) +1]— 1), 
asd = al- (4.14) 


h these valu 


It is interesting to note that wit ез of m, 

1 
ста + (1—с) та — тт 1) 
ав compared with the value n(n+1)(n4 2) 
also with the fourth moment of у. 


т and c, we һауе 
(%-++4) gà 


(n+ 3) pt which would give exact agreement 
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(iii) Non-central F 


Two alternative approximations to the non-central F probability 


integral will be put 
forward here. 


The first is based on the direct approximation to the non-central 6 distribution using 
(i) of this section to approximate the non-central Ліп the numerator. This leadsim mediately 


to the approximation to the distribution of non-central бі 2] 
the ratio (1 - (2A + 1)79): (1 (92 4 1)-) 
of the central G distributions of 


Av Ху 


, by the mixture in 


Gum Gs Where и, = у+А+1 + /(2А+1), 
Equivalently, Fc ORI) 
80 that 


Pr(F,, < XI) = M1—(2A4- РИ, и < Хр} 


+31 + (QA 1)H] Pr | P Хр}. (20 
The second approximation is based on (13). Using (ii) of this section we h 
Ва э b) e D - ОНР, 1» b) ede Qua 1)-}] Pr 

where Ea £y are Poisson variables, independent of J, w 

MX, = X Jr 1) (ИЗ + 11+ 1)/ng, 


respectively. Proceeding further we use (4) 
integr 


Ma = VEA S 1— (А+ 7); 


Хо) is approximated by a mixture of "Е оо and Е” 


) 


ауе 
&-1218, (21) 
ith expected values 

EX X (d+ D уор, 


to express (21) in terms of non-central y2 
als. We then use (ii) of this section to obtain àn àpproxim 


ation to the non-eentral И 
integra] in terms of four central y? integrals. This is 
Pr(F,, < XI) = qn — Gn 1)7][p- (234. 1)-%уР 
++#(1+(2А+ 1)3) Pro. < Xy] 
Fall + (vot DAJO — (2A + 1)-4) Pr = 3 
+A + (2+1) Priyè < ху, 
alues defined above, and 
и =ь+А-+1+ A(A 1), v =У+А+1- 


TO «XQ 


Where X. X » have the v 


(2A 4-1). 
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Note on an approximation to the distribution of non-central y? 


Bv E. S. PEARSON 


E(x’) = v+a, a(z’) = J(9(v 4-92), 


(1) 
8(v + 3A) 12(v 4- 42) 
filz’) = D ] 


bey: Ala ty SOS) 
(msc Arle’) = 3+ (РЭА) 
Tt follows that the Ра, Ba point of the distribution must fall within a fairly narrow sector of the Bu Ps 
field lying between the straight lines 


f.—if,-3-20 (when Д/»-> 0), 


В, -3=0 (when v/A - 0), 
the former being the x? or T: i 
differ greatly from that of a Y? di 


v and À are small, it mi 
Standard deviation and f) coeffici 


by using a y? distribution with the correct mean, 
This latter result is achieved b 


actly the right start, 


y taking 
х, р V+3BA A: (3) 
V E жо Ty 9А и, 
е Gy, 906) 60) VFA” iva 
where д2, is distributed as а central y? having fractiona] degrees of freedom given by 
= 08 (фәл) (3) 
v = ——__ = 1 
ДЕЙ) (v 3A)? 
Using equations (2) and (3) and interpolating for v’ in the table of percentage points of y? (Biometrika 
Tables for Statisticians, 1, table 8) we obtained upper and lower 50 Н 
errors as shown in columns (v) of Joh i 


о points for non-central y? with e 
nson's table ], Again, for the » Vand A values given in his table 20, 


ma і r 
> It is clear that for tho particula 
‚ the three-moment № approximation 
ved is Straightforward, apart from t 
fractional degrees of freedom, v’, 
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THE y? TEST FOR SMALL EXPECTATIONS IN CONTINGENCY 
TABLES, WITH SPECIAL REFERENCE TO 
ACCIDENTS AND ABSENTEEISM 


By C. A. G. NASS 
Netherlands Institute of Preventive M. edicine, Leiden 


1. INTRODUCTION 


In the study of absenteeism we are often presented with а. large number of small samples of 
Sickness absences or accidents, specified for two or more subperiods, each sample being 
contributed by one worker, observed during the total period. We desire to know whether 
the absences, say, of individual workers can be regarded as random samples of such a 
population, given by the marginal totals, regardless of whether or not the marginal totals 
agree with the lengths of the subperiods or some other assumption about the distribution 


of absences in time. 
Thus we are presented with an (m x n)-design, with (m — 1) (n—1) degrees of freedom: 


Ші ee ij o т | А 

V ij e Vim Ži 

" wis umo зв oes с (1) 
Va Фа) “ат 2а 

Và o Yi o Ymi N 


The joint distribution with which are are concerned is one that could arise, among other 
Ways, if from a finite population, subdivided into m classes with у, ..., Ym elements, samples 
of z, 292,1 elements are randomly drawn without replacement, the remainder being 
considered as the nth sample with z, elements. In our particular problem the rows of (1) 
represent workers, the columns subperiods and the letters the number of absences or 


accidents of one worker іп one subperiod. | : 
Our test of homogeneity or independence within the contingency table will be based on 
the isti ` ena 
Statistic Q = МУ у М. (2) 
ij 


1)(n—1). (3) 


The standard test is to set 22-06, у= (т 


It is well known that if none of the expectations of 2;; are too small, then under the null 
hypothesis of independence, G will be distributed approximately in a chi-squared dis- 
tribution with a parameter v equal to the number of degrees of freedom. When, however, 
the expectations are small the distribution of G will differ from that of x?, partly because the 
°rmer becomes noticeably discontinuous and partly because the moments of G, beyond 
the first, may be rather different from those of д2. | 

ith absenteeism it often happens that a number of expectations of ?;; are smaller 
than unity, because a number of workers have only one absence in the total period of 
Observation. Haldane (1937) met with the same difficulty in the field of genetics. In his 
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bers of 
the rows of (1) represent litters, the columns coat-colours and the letters num 
case 


+ : H H H of 
ice. He suggested using the normal distribution as an approximation to the distribution 
mice. 
4 
Қ” u = (G—E(G))| var G, (4) 


— E ‘ andard 
where wis assumed to be approximately normally distributed about zero with unit stan ет. 
deviation. However, this approximation must be poor if the coefficient of aie n 
is not small, which occurs in the field of absenteeism as it does in genetics (Grüneberg 
Haldane, 1937). | | Көке” 

Vessereau (1958), іп a paper restricted to the one-way classification (a special ae 
design (1)), considered the approximation of the distribution of cG by a chi-squared 


В " ts 
tribution with parameter v, where c and v are determined such that the first two momen 
of the former distribution are equal to those of the latter, i.e. 


X -cG, c—2E(G)|varG, v = cE(G). (5) 


He also considered the approximation of the distribution of a+G by à chi-squared dis- 
tribution, with suitably chosen а and v. He found that it was markedly worse than in the 
case of (5), judged by the third and fourth moments. 1f 
It might be conceived intuitively that the approximation (5) must be the better one. 
expectations are decreasing and the variance of G has ceased to be nearly twice its expecta- 


tion, the general shape of the distribution of G might continue for a while to resemble that 
of some chi-squared distribution and i 


t might continue to start at only a little above ШЕТ 
because the values of N(y;z;)-! are not integral. Probably a still better approximation coul 
be obtained by setting X? = a+cG, ensuring correct values for the first three moments: 
But the third moment of 


G for the general design (1) is at present unknown and even і 
determined, would probably be too unwieldy for practical use. 


It is true that Vessereau considered the additional computations involved in determining 
the c and v of (5) to be prohibitive. However, the grea 


problem considered. For these reasons we (4 


In the present paper we shall consider met; 


expectations are small, based on Ve 
favoured by its author. In зо far as thi 


9 not support Vessereau’s advice. 

hods of determining the significance of 6, Ms 
Ssereau's approximation (5), although it was no 
€ resulting value of p ig beyond the range of existing 
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tables of the chi-squared distribution, we shall use R. A. Fisher's approximation, namely, 


assume that u = (2x2) -4 (2v —1) (96) — (2v —1) (6) 


is normally distributed about zero with unit standard deviation. Comparisons between the 
true and approximate distribution of G, although limited to a few easily caleulated cases, 
do suggest that the approximation is likely to bea good one in a very wide range of situations 
likely to be met in the analysis of accident and absenteeism data. 


2. ТнЕ AVAILABLE TOOLS 


Тһе general design (1), and some of its special cases, constitute a handy set of tools for the 
study of accidents and absenteeism. For all of these the parameters c and v are obtained in 
the same way (equation (5)), but the appropriate expressions for G, E(G) and var ©, to be 
given in this section, are different. For each of the tools we will give an appreciation of its 


special merits for certain jobs. 


(i) Case of mx n-fold table with (m — 1) (n — 1) degrees of freedom 
This is the general design (1). Equation (2) gives the expression for G. That for E(G) is 


m-1)(n—1)N 
n= ie ш (т) 


After publishing expressions for the moments of G for several special cases, Haldane (1939) 
found an expression for the variance of G for the general design. Dawson (1954) put Hal- 
dane's expression into the following shape, which is more suitable for computation: 


(n —1) CN -т) рм (т-1) (N —m) 


ртр C N-1 ? 
NE) -n (351) –т? 
«ШШШ, тес N-r c e) 
2N N? 
then varG = V-3 (p-e)(—7)* HO 


The merit of this tool is that it tests independence regardless of distribution in time and of 
distribution over workers. In this context ‘independence’ means that the set (pi, ..., Dm) 
of probabilities of subperiods, for an arbitrary worker with 2 ripam in the total period, 
is independent of z and of the worker. ‘Distribution in time’ means the distribution of 
absences over subperiods, regardless of the population of workers. Accordingly, this concept 
presupposes the existence of independence and it should be considered only after the data 
have successfully passed the test for independence, or if there i ier ea reasons to 
take independence for granted. Under this pre-supposition : ies ve in time is 
multinomial and assumptions about the values of ру, ; ‚.› Pm can be tested with the classical 
xM i с і ill usually be large enough. 

i is Na ii PS over ari, and its inverse, the distribution of workers 
Over numbers of absences, lose practical importance if they are not constant over all 
Periods with N absences in total. Analysis of such distributions should also be preceded by 
the test for independence, if the data are specified for subperiods. 
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(4) Case of m x n-fold table with m(n—1) or (m — 1) n degrees of freedon - ко 

These are limiting cases of the preceding one. In fact, if an (m + 1)th so is к, 

the design (1), with a column total Y tending to infinity and cell-frequencies b де pe 
the quotients X;/Y tend to fixed numbers p; then the contribution of column (2 i 


; s : th, in 
vanishes with probability one and (1) tends to a design with random row totals with 
the limit: 


G= Yshpsyj-N, | (9) 
1,7 

E(G) = m(n—l) varG = 2т(п—1)+ (Epi! +2 п —2n) Eyj!. ! 

Тһе equations for the case with (m —1)n degrees of freedom are symmetrical with а 

'Тһезе results were found by Haldane (1937). In these cases independence is tested sin 


z og 2 еее lU 
taneously with an assumed distribution of absences over either subperiods or worker 
is difficult to imagine a situation in the st 


udy of absenteeism where this model would be 
appropriate. 


(11) Case of one-way table 

И we set m = 1 in case (ii) with m(n — 1) degrees of freedom, the design reduces to a dis- 
crete one-way distribution with n cells. Dropping the subscript j, the appropriate equation 

nih G -XGQNp)-N, E(G)-m—1, \ 


varG = 2(n — 1) — (n2 4.24 — 2)/ N  X(N p;)3. | 

These results were found by K. Pearson (1932) 
Usually one or more parame 

To provide for this fact we su 

(n —k — 1)/(% 


(10) 


ters must be fitted to the зал 


ggest that E(G) and var (y. 
— 1), if k parameters are fit 


nple distribution to find the Pr 
found with (10), be multiplied by 
ted. This is the approximate effect; of efficient 
arge. That it would apply also when expectations 
t it seems better than no correction at all, which 
would certainly produce an inflation of the errors of the second kind. 

It is often desirable to reserve the notation Ti, or x for the class values of the variate of 
the distribution tested. In that case the class frequencies might be denoted by n; or n 

Tf the range of the distribution isinfinite, it must be curtailed somewhere., Since departure 
from expectation often depends heavily on one or two very improbable sample values, ы 
ations smaller than a, few units is apt to produce 
c erit of case (iii) is that it allows us to postpone the curtail- 
ment until expectations are well under one unit, Some results of § 4 below suggest that one 
could go as far as to include expectations of 0-05, if the number of cells is large and if there 
are also considerable numbers of cells with somewhat h 


igher expectations. 


(iv) Case of equa] Probabilities 
If we set p; = n- in case (iii), equations (10) become: 
G-n»Xo)|N-N, mg). n=l, тана = за 1) (N —1)/N. шы 
One merit of case (iv) is that it allows a correct; 


lon for continui 
n from (11) that 
between correspo: 


e value of Уд. Tt is see 


ion of 
ty, namely the subtraction 0 
values of G are proportionate to differences 


i д t 
differences between adj желе 
nding values of Xa?. Thus t 
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proper correction should be to subtract from the sample value of Уа? Ва the distance to the 
preceding value. If one is subtracted from a cell-frequency, say z,, and added to 


а smaller 
cell-frequency, say x», then Xe? is diminished by 


ritti- (а= 1)? (85+ 1)? = (x, a, — 1). 


It follows that the distance to the preceding value is at least 2, and that it is exactly 2 if 
there is at least one pair of cell-frequencies that differ by two units. If the cell-expectations 


are small, it is most unlikely that none of the cells will be empty and that none will contain 


ase the value 


just two observations, unless all frequencies are either one or zero, in which с 
of Xa? is the smallest possible. In fact, it appears from the exact distributions of G, 


shown 
values of Xa? 
proceed by steps of two units, up to a cumulative probability of at least 0-999. When 
expectations are not small, greater steps might occasionally be located in the critical part 
of the range, but then the number of possible values of G is large and the effect. of dis- 
continuity is small. It seems, therefore, that the subtraction of one unit from the 


in $4, with expectations 2, 1, 1, 0-5, and 0-2 respectively, that the possible 


sample 
value of Уа? is an effective correction where a correction is most needed. 
і 4 4 н 
It follows from equations (5) and (11) that, with correction for continuity: 

n(Xa;—1)— №? N(n—1 

ет „ш лу (12) 
N-—I —1 

Thus another merit of the case (iv) test is that the computations involved ате very simple. 


However, the most important merit is that it may be considered as an adapt 
, 
expectations of Fisher's test for Poisson distributions. 


ation to small 


If we wish to test whether the observations ty, ta, ..., 2, are independent Poisson variates 
having à common expectation, the standard test, as first given by R. A. Fisher (192 
to calculate EX(v—3) nX(a) 


am N 


5). is 


- А. 


This is the G of (11), the distribution of this criterion under the condition Ха; 
Same as the distribution of G considered above. 

It is well known that Fisher's criterion discriminates powerfully between a Poisson dis- 
tribution and a distribution with a variance noticeably greater than the mean, but poorly 
if the variance and the mean of the alternative distribution are also equal. Therefore, if the 
data have successfully passed the test of case (iv), it may not be superfluous to try the test 
of case (iii), fitting a Poisson distribution to the data and comparing the observed 


= N being the 


and 
expected cell-frequencies. 
(v) Case of n elements distributed over n cells with equal probabilities 
If we set № = n in case (iv), equations (12) become 
а.  R(Xaj—-n—1l1) 02 
Ju ағыны (13) 


This gives a test particularly suited for testing an assumed even distribution of 
an interval, if we divide the interval into as many parts of equal length ав there 

t sometimes happens that the end-points of the interval of observation of a 8 
бау, accidents are not recorded. Then take the times of the first and the last ace 


points on 
are points, 
equence of, 
ident as the 
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i i g i an- 
ints and for the rest, disregard these two accidents. In fact, if n+ 2 p 
cum мөө evenly distributed on an interval (а,0), then under the condition tha 
omly 


, S are random and evenly 
ositions of the extreme points are a’ and b he » other point: y vi 
pos "b th 1 
distributed on the interval (а > b № 


3. METHODS OF COMPUTATION 


+ 5 1 a 
ions i i i i i re so extensive that it 18 
The computations involved in case (i) of the preceding section are so exten: 


practical necessity to arrange them in some economical scheme. In the applications et 
we have in mind the number of rows (n) will usually be much greater than ds num : iei 
columns (m) and most of the row totals (2) will have small integral values. Lene i. : 
tity Хуу! may be computed directly from the Ур but for the computation of Ха i 


should then first be arranged in а frequency table. If», denotes the number of rows (workers) 
with z; — z, then 


Iz! = nz. 


. H 5 
It is economical to combine these computations with those for G. We shall give two schemes, 
one for m > 2 and one for m = 2. 


Scheme for m > 2 
(1) Write the rows of design 

(2) For each j, sort the strips according to ж. Note the frequency distribution of Xij 
its number, n, and its sum, y;. Pool the frequency distributions for all j and note the sum, 
and the sum of squares, Q. 


(1) on separate strips or on the upper margin of cards. 


(3) Sort the strips according to the row total, z. Prepare a table with as many rows 28 
there are different values of z and with 9% 


+ 5 columns, under the headings, 


(2) (2) (n)... (а)... (в)... (4)... (4), 
for each 2 writing under (п.) the numbe 


under (s) the sum of the Sj, under (4)) the 
of the q;. 


(4) Add the columns (n,), 


T of strips, under (sj) the partial sum of the 24) 
partial sum of Squares of the т, under (q) the pup 


э (83), +», (8) and (4) and check whether the sums are equal 50 
^, ..., Jj, ...,.N and Q, already computed under step (2). Write under the (т) the product-sums 
9; = (2) x (qj. 

Check whether Х0;-(г-)х(4) (written under (q)), 
Write under (2-1) the product-sum 

Ezy! = (2-1) х (n,). 

(5) Compute the sum Xy; 


* directly from the У; and the product-sum 


жауа = (улу х (0). 


using the equations (2), (7), (8) and (5). 
that the greater part of these computations can be done РУ 
eing reproduced automatically, using 0 7 
dure is illustrated on a numerical example in § > 
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Scheme for m = 2 

If there are only two columns, no separate strips are needed and the following simpler 
procedure may be followed: 

(1) Prepare, by simple scoring, a contingency table of the rows of design (1), with the 
two cell-frequencies as entries. Denote the different values of the x, by x and those of the 
Vi; by 2—2. The frequency distributions of x and 2 – 5 are found by addition of rows and 
columns. Note the total number n and the sums y, and y, of x and z—2. 

(2) Prepare a table with as many rows as there are different values of z and with five 
columns, headed 2 

e) (2) (2) (n) (x) (29). 


For every z, write under (n), (x) and (2?) the total, the sum of z and the sum of x? over the 
diagonal of the contingency table with the corresponding value of z. 

(3) Add the columns (n,) and (x) and check whether the sums are equal to n and UM 
respectively. Check whether 

М = (2)х (п.) = ys. (written under (2)). 
Write under (2-1) and (2?) the product-sums 
Уггі = (2) х(п.) and Q-(z7)x(a?) 
(4) For the computation of G use the equation 
G = N(Qui? ye!) - ya) 


Which is more convenient here than equation (2). Compute (©), var б, c, v and x? with 
equations (7), (8) and (5). 
This simpler procedure is also illustrated in $ 5. 


4. SOME TEST CASES FOR THE ACCURACY OF THE APPROXIMATION 

For designs restricted to two columns (case (i) of $2 with m = 2), and row totals, 2,, not 

greater than two, the computation of exact distributions for G is not unduly heavy. Since 

the approximation presumably improves for m > 2 and larger row totals, such designs are 

. Particularly suited for test cases. Let n be the number of rows, n, of which have row total 1, 
the remaining n; having row total 2, then 

Ny + 2ny = yit a = N. 
Of the cell-frequencies in column 1 with row total 2, let а be the number of those equal to 1 
and b the number of those equal to 2. Then the numbers of the five possible types of row are: 


Cell-frequencies Number of rows 


11 a 
2,0 b 

0,2 ny—a-—b 
1,0 4,—a-— 2b 
0,1 n,—y,T at 2b 
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The probability of a particular random sample is 
у! Yo! 24] М! 
and the number of possible samples with given a and b is 
n! Na! 


(y; — a — 2b)! (п, — y, +a + 2)! alb! (ng —a—b)V 
Thus the probability of a is 


Yr! Yo! n, no! 92 1 
P(a) 1 E 2 


= : (14) 
j^ b! (nj —a—b)! (y, a — 20) (n4 — y, +a 90)! 


Тһе criterion G is found to depend on a only, since 
а; afl 1\ 4b 4(n,—a—b) y,—a-—93b т-у +а+?2 _ aN 

I= + + m т uel 77 

ыў; 24% Y 2, 2 Yı Yz ^a 


It follows by equation (2) that a х: _ ам ) | 


2/1 


Since a proceeds by steps of one, the value corrected for continuity is 


Gs (i-es 2) 
ЕТА 


Using equation (14), the exact distribution of a зу 


as computed for two designs of the type 
shown in Table 1. 


Table 1 


Design: y, — Уз=20, ny = 


10, n, = 15, Design: Yı = 280, y, = 20, n, = n, = 100, 
N-40, п= 25 Ы Ыы 


М = 300, n= 200 


а Р(а) a P(a) a P(a) a P(a) 


a P(a) a P(a) 

0 | 0-00002 6 | 044071 | 12 0.01825 | з 0-000092 à n -09893 
1 | 000035 | 7 | 0418906 | 13 0-00464 | 4 | 0.00011 i porum = део? 
2 | 000255 | 8 | 0410840 | 14 0.00075 | 5 | 0.90060 1l | o34070 | 17 | 0.01976 
3 | 001142 | 9 | 016985 | 15 900006 | 6 | 0.00959 12 | 017585 | 18 | 0-00541 
4 | 0-03550 | 10 | 0-10378 7 | 00839 | 12 өшті! DUE 
5 | 0-08108 11 | 0-05054 8 


0-02276 


14 | 0314939 | 20 | 0.00007 


Total 0-99996 


Total 0-99993 


Application of equations (7). (8), (5), (1 
The comparison of exact and app 
are equivalent to those of (7, is sho 


(15) and (6) gives the results Shown in Table 2. 
Toximate cumula 


Е tive distributions of a, which by (15) 
wn in Table 3. 


= 


M Rc 


+ 


қ» 
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Table 2 


Design: y, = уз = 20, n, = 10, Design: уу = 280, y, = 20, 
na = 15, N = 40, n= 25 n, = п, = 100, N = 300, n = 200 


E(G) from (7) 24-615 199-67 | 
var G from (8) 15-289 300-73 

с 3:220 1-328 | 
>) from (5) 79-26 265-2 | 
u from (15), (5) and (6) 2:538/(39 — 2a) — 12-591 1-886 (221 — ба) — 23-01 


ТаЫе 3 


Design: y, = y, = 20, n, = 10, n, = 15 Design : y, = 280, y, = 20, пу =n = 100 
pe | unti n] 
a Exact |Approx.| а Exact Approx.| a Exact | Арргох.| а | Exact Approx.| 
0-000 | 0-001 8 0.659 | 0-624 + 0000 0000 | 12 ' 0.494 | 0.495 


0-000 0-002 9 | 0-822 0-834 5 0-001 0-001 13 | 0-674 | 0.676 | 
0.003 | 0-008 | 10 | 0-926 | 0-937 6 | 0-004 | 14 | O-s24 0495 | 
0-014 0-024 11 0-976 0-983 7 0-013 15 0-923 | 0-923 
0-050 | 0-062 | 12 | 0-095 | 0-997 8 0.036 | 16 | 0-974 | 0-973 
0-131 0-141 13 0.999 | 1-000 9 0.085 0-086 17 0-994 | 0-993 
0-272 | 0-275 | 14 | 1-000 — 10 0-178 | 0-181 | 18 | 0-999 | 0.998 
0-461 0-460 11 0319 | 0319 19 | 1-000 1-000 


3з © ‹л о юкб о 


In the first design the approximate 1% level of significance is correct, namely at a = 2, 
The exact 5% level is at а = 4, the exact and approximate probabilities of a being smaller 
than 5, being equal to 0-050 and 0-062 respectively. This design contains 20 cells with 
expectation 0-5. The second design contains 100 cells with an expectation of only 0-067, 

et the approximation is excellent for this design, the 1 and 5 % levels of significance being 
Correct, Our conclusion from these comparisons is that in the field of absenteeism there 
Vill be little occasion for serious errors, due to poor approximation, if case (1) of $2 is applied. 

_Vessereau (1958) has given exact distributions of G in three designs with № elements 
distributed over n equi-probable cells, namely for 


N=15,n=5, N=16,n=8, N=10, n= 10. 


Such designs, apart from being the only cases of one-way classifications for which the com- 
Putation of the exact distribution is practically possible, are of special importance, since the 
“Stribution is the same as that of Fisher’s criterion, assuming that Уз, = N is fixed om 
(iv) of $2). For our purpose, however, V. essereau's designs are not very critical, since their 
Values of № and т, if presenting numbers of absences and of workers, seem far below dim 
thing that will be asked for in practice, whereas expectations smaller than N/n = 1 wil 
."equently occur. One also wants to get some idea of the speed with which the approximation 
proves with increasing n in the important case where n = N. about its behaviour if n 
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nt of 
ly one has to take account of the amou i 
i i ant N and so forth. Unfortunately снема И ped 
2 aen прен еб in particular, in finding the exact distributions. For Ro cdm 
; : 4 
ys qnl increases rapidly with Ж; once it is done however for such a n Jie 
Mm сз 1 val fn is comparatively small. 
iti с і th the same N and greater values o 
additional work for designs wi астаса wenige до 
i i 7 i take Vessereau's second and thir g 
ch considerations we decided to | | кай 
a with N = 16 and n = 16, 32 and 80 respectively. For these, we compare the e 
esi - , | 
istributi ith i ximation according to (12). А 
stribution of G with its approxima : — түз 
di: Vessereau has described the following method of finding the exact roii ach deri 
a design of N elements distributed with equal probabilities over n cells: Let a әліп y^ ] "d 
over т be defined as an ordered set of n non-negative integers тү, such that = e^ E 
Po Pı Py be a sequence of non-negative integers. The union of all partitions w 
o: Pas e D 


artitions 
numbers equal to 0, ..., py numbers equal to N will be referred to as a class. АП partitior 
of a class are equi-probable, since their probability is 


n-NN! 
(212... QN oa 


The number of partitions of a class is n!|(po! ... Ри!) so that the probability of a class is 


nV М! (16) 
Po. Py) = (21р... (NT)?N pol... py!” 
7 
Let Q be defined by Q = Za? = 1+4. +... + N?py, | (17) 


A : : e 
80 that all partitions of a class have the same value of Q. Consider two designs with the Ея f 
value of N, the first with n = N and the second with a greater value of n. To every class 
the first design there exists a class of the second design with the same values of p, ..- PN 


and with a value of Ро that is n — № greater. Let q denote the value of Фо under the first 
design, then the value of fo 


of the corresponding class under the second design is q+” =a 
and corresponding classes have the same values of q and Q. Denote the probability of не: 
union of all classes with given values of g and Q under the first design of P(g, Q) and that o 
the corresponding union under the second design by Р.(9, ©). Tt follows from (16) that 


(Ny? N-N қ 
Р, Ы | 
№4, Q) 4! (20... (NI р... pei " 
P,a, Q) = (Nim а! 


1 (4+ 271 2099). 


" 


y 
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Design: М = 16, п = 32 


ТаЫе 4 
G | (в) | a | ре | © | рб) | в | во) 
Design: N = 16, n= 16 
| 

0 = 16 | 013914 | 32 | 000332 | 48 
| 2 0-00009 
2 | 0:00014 | 18 | 010746 | 34 | 0-00276 | 50 | 0-00006 
4 0-00310 20 0-06998 36 0-00157 52 0-00002 
6 | 0.02302 | 22 | 004922 | 38 | 000094 | 54 | 0-00001 
в | 0.07212 | 24 | 0.02740 | 40 | 000044 | 56 | 0-00001 
10 | 012829 | 26 | 001989 | 42 | 0-00020 | 58 | 0-00001 
12 | 017017 | 28 | 001172 | 44 | 000023 | 60 | 0-00001 
14 | 016260 | 30 | 0-00621 | 46 | 000016 | 62+ 0-00001 

1-00000 


ү мм...ммм"Ш"ШШЫЫЫ=6И] нь 


0:01040 
0:07343 
018562 
0.23396 
0-19905 
0-13970 


_ 0-07549 


0-04089 
0.02256 
0-00936 
0:00468 
0-00246 
0-00126 
0-00060 


72 


76 


0-00024 
0-00010 
0-00010 
0-00006 
0-00002 
0-00001 
0-00001 


1:00000 


| | 


Design: N = 16, n = 80 
64 0-20039 114 001344 164 0-00008 
74 0:36995 124 0:00427 174 0-00004 
84 0:25504 134 0.00193 | 184 0-00001 
94 0-10990 144 0-00051 194--| 0-00001 
104 0-04430 154 0-00013 
1-00000 
£K f 


The fact that the sum of the probabilities is ex 
Since no adjustment to that e 
Approximation (4) was not u 
(1922) Tables of the Incomplete 

abilities Z(u, p), starting from the 
which are related to the v and the x? 


ffect was made. Fo: 
sed, since four of the values of v were too small. К. Pearson's 


T-Function were used instead. These give cumulative pro- 
left, of а variate %, associated with а parameter р, 
of equations (9) by | 


3 


5 


actly one in all three designs is a coincidence, 


r the approximative distributions the normal 


р= 40-1, ш = (2. 
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ose, using у ti 2)werere laced DY 
For i inui ions (1 )and (12) ere rey 

i i the correction for continuit equa 

or this purp р 5 


N(n—1) NG-n 


v (20) 
Ру лу, “= JNa- DW i] 


Application of (20) gave the results shown in Table 5. 


Table 5 - 
| б, n= 80 | 
Design veto nat N=16,n=8 m п=16| N=16, n=32 | N=16, n= | 
E UM... — . | 
p 4 2-733 Е | 15-533 | 41:133 
ч 


0-24846(@— 1) 0-13802(2G — 1) | 0-18856(G — 1) 
| 


| i | 


NEL : : : - sof G. are shown 
The comparisons of the cumulative distributions, starting from high values of G, are 
in Table 6. 


Comparing the two designs with N = n, it appears that the approximation yet we 
moderately, if N increases from 10 to 16. In the latter design, the 1 and 5% levels 
significance are correct. In the former design the 5 % level is correct. Т 
1% level is 22, whereas the exact probability 


The error is hardly of practical importance. It m 
N 


= n, the approximation is good enough if N — 
with V 


ы А а 
et even in the last design the nominal 1 an 
- 124 and 114 respectively. 


t 
‚ each with expectation 0-2. We have seen before tha 


ign with 100 cells with expectation 0-067, 100 cells 
xpectation 0-93 and 100 cells with expectation L8 
It seems that the disturbing influence of small expectations is effectiv 
large numbers of cells, especially if the 


ely counteracted ur 
re are also considerable numbers of cells with some 
what greater expectations, 


with expectation 0:13, 100 cells with e 


5. APPLICATION оғ THEORY TO NUMERICAL EXAMPLES 
Example 1 
Р т 
(1939) of the accidents of 166 London bus-drivers pet 
years constitute a design with 5 columns and 165 rows, since one driver, vá 
had not a single accident, had to be omitted. So m = 5, n = 165. The yearly totals and th 
grand total are: 


Тһе records of Farmer & Chambers 
a period of 5 


i 9% | Уз | Yı | Ys N 
| ШЕ 
ші e 268 250 255 1330 


= 7 


be^ 


Design: № = 16, n = 8 (exact distribution taken from Vessereau) 


16 | 
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Table 6. Cumulative distributions, starting from high values of G. 
G | Exact pom G Exact | Approx. | G Exact |Approx. | G | Exact |Approx. 
= | | 
Design: № = 10, n = 10 (exact distribution taken from Vessereau) 

о | » | | 
2 1000 1-000 10 | 0-433 — 0-402 18 — 0-039 | 0-042 26 | 0-003 | 0-003 
ME 12 | 0-340 | 0-270 | 20 | 0024 | 0-020 | 28 | 0-002 | 0-001 
6 0-869 14 0-157 22 0-014 0-010 30 0-001 0-000 
8 | 0-657 0-071 0-082 | 24 | 0-006 | 0-004 | 32 | 0.00 | — 


1-000 
0-989 


0.732 | 
0-618 
0-479 | 


1-000 
0-986 
0:935 
0-846 
0-731 
0-607 
0-487 


0-384 
0-274 
0-209 
0-151 
0-112 
0-073 | 
0-056 | 


0-379 
0-288 
0-214 
0-159 
0-127 
0-079 
0-056 


16 0-038 | 
17 0:027 
18 0-020 
19 0-014 
20 0-009 
21 0-006 


32 | 0-004 | 


Design: N = 16, n = 16 


0-039 
0-027 
0-018 
0-013 
0-008 
0-006 
0-004 


ыы 


BIA C e 


| | | | | 


| 
6 | 


16 


0 
24 
28 


64 | 


г 
| 


1-000 
1-000 
0-997 
0-974 
0-902 
0-773 


1-000 
0-990 
0-916 
0-731 


1-000 


1-000 
0-999 
0-994 
0-963 
0-887 
0-762 


0-997 
0:973 
0-890 
0-730 


16 


20 
22 


0-603 
0-441 
0-301 
0-194 
0-124 
0-075 


0-609 
0-453 
0-316 
0-208 
0-131 
0-079 


24 0:047 
26 0-028 
28 0-016 
30 0-010 
32 0-007 


0-004 


36 | 0-002 
38 | 0-001 
40 | 0-001 
42 | 0-001 


44 | 0-000 


Design: N = 16. п = 32 
32 0497 0518 | 48 0041 0-036 | 64 | 0-002 | 0.00) | 
36 0.298 | 0332 | 32 | 0-019 | 0-014 | 68 | 0.001 0.000 
40 | 0-158 | 0-173 56 0-010 0-005 72 0-001 | -= 
44 0-082 0-084 60 0-005 0-002 76 0:000 | — 


0-966 94 0-175 0-201 124 0-007 | 0-002 154 0-000 0-000 
0-789 104 | 0-065 0-059 134 0-003 0-000 
0-480 114 | 0-020 0-012 144 0-001 0-000 | 
| 
= 


Biom. 46 
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Тһе numbers n, of the personal totals z are: 


| 2 n 
z n z | п, ы | п: И | 
| = 
| 
; 6 
1 2 6 17 11 | 9 16 | 4 
2 3 7 14 12 6 i ІТ 
3 14 8 15 d Е 21 3 
4 17 9 | із 14 6 E i 
5 21 10 | 13 15 1 | 
Total 165 


The sums of the inverses of row- and column-totals are: 


Ez! = 28-5788, Ууу! = 0-018881. 
The results of equations (7) and (8) are: 


Р = 143-762, д = 3-9880, 


(= 8121, 7 — 0-000084, 


ЕС) = 65655, varG = 1085-2. 


Using (5), the Parameters c and y ате: с — 1:210, v = 794-4, 
To illustrate the method of com 


two columns, the Personal scores 
period were ав follows (first schem 


putation, recommende: 


s an 
d in $3, for designs with more th 
of the n, = 6 drivers 


я tal 
with z — 12 accidents in the to 


е, step 3): 
а | om © m % Total 
ааа - - ee 
3 2 4 1 2 12 
3 2 4 2 1 12 
2 3 1 2 4 12 
5 2 1 3 1 12 
2 2 2 2 | 4 12 
1 3 2 3 3 12 
в, = Ушу: 16 14 14 13 15 72 
qi = Xa: 52 34 42 31 47 206 


m+ 
is given in Table 7, showing t 


^ 


) 


! 
! 
| 
| 
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Note that only the ‘totals’ of columns 3, 4, 5, 6, 7, 8 and 9 are real totals, the other ones 
being product-sums, as described under step 4. There is no ‘total’ corresponding to 
column 1. With these results, y? is computed as follows (step 5): 


100-425 783-909 80-047 73-708 | 73:428 


Sely z = 50р ^" 256 + 268 ' 350 ` 255 
= 1.5038, 
С = 1330 x 0:5038 = 670-1 (see equation (2)), 
x = 1-210 х 670-1 = 810-8 (see equation (5)), 


и = J1621-6—4/1587-8 = 0-42 — (see equation (6)), 


80 that the test does not reveal a significant departure from homogeneity. 


Table 7 
є 
Column .. | 1 2 3 4 5 6 7 8 9 
Heading .. | @ | ез ео | G@ | Gd | @) | Gd | @) | € 

Row (z = 12) 12 | 0-0833 ó 16 14 14 13 15 72 

pom — 

‘Total’ . | овбтев | 165 | 301 | 256 | 268 | 250 | 255 | 1380 
Column... 10 11 12 13 14 15 
Heading ... (a) (4) (а) (а) (as) (4) 

Row (2 = 12) 59 34 42 31 47 206 
‘Total’ 100-425 73-909 80-047 73-708 73-428 401-517 

— 


Example 2 
two columns, consider Adelstein’s data 


riods of 5and 6 years, cited from Arbous 
n the form shown in Table 8 (see 


s an illustration of the outline for a design with 
Accidents of 122 shunt-workers, specified for two pe 
st Kerrich (1951). The complete records are 8 

ер 1 of the second scheme of § 3). | m 

In agreement with the scheme, the number of accidents in the 5-year period fs denoted 
by * and that of the 6-year period by 2-2. Tf the difference between the periods makes 
бы Worth while, it saves some computation to take the shortest one for x. From the 8 
diagonals of this concise table the lines of the 5-column table are easily written down 
(Step 2 of the scheme for m = 2) (Table 9). 


ven i 


24-2 
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Table 8. Adelstein’s accident data. 


| 2—2 
| 1 
5 Тоба 
" 0 1 2 $ 4 5 | 6 
| 50 
0 21 18 8 1-9 1 2 
Я: 13 14 10 1 4 1 | А 17 
2 4 5 4 2 1 & 5 
3 2 1 3 2 м 1 2 
4 4 1 1 0 
: 0 
6 y : 
7 1 
3 E 122 
Total 40 39 26 8 6 2 1 
Table 9 
(=) (21) (п.) (x) (x*) 
1 1-0000 31 13 13 
2 0-5000 | 26 | 22 30 
3 0-3333 | 19 26 48 
4 0-2500 | 7 12 | 26 
5 0-2000 9 17 39 
6 011067 | 5 13 39 
7 0-1429 1 4 16 
8 0:1250 | 3 12 62 
| | 
n REN = | 
274 54984 — | 101 19 | 74836 
| 
| | 


In agreement with step 3 of the Scheme, on 


i E x 
ly the ‘totals’ of the columns headed (n,) and ( 
are sums, the other ones being product-sums, Hence 


m=2, n=101, N = 274, y; = 119, Yo= 155, X 


984 
07' = 0-014855, Уггі = 549 
G = 274 x (74-836 x 0-014855 — 119/155) 


i 
= 94-26 (step 4), 

P = 63-370, p= 0-99634. 
6 = 17-885, r= 0-00026, 
E(G)— 10037, varG = 92-99 (see equations (7) and (8)), 
с= 1:079, v= 108:3, = ТОЛ, us — 0-42 


(see equations (5) and (6)). 
Again there is no significant departure from homogeneity 
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Example 3 
We haveapplied the same test to the absences of the 248 shift-workers recorded by Arbous 
& Sichel (1954) and again there was no significance. The personal totals of these shift-workers 
will be used here to illustrate the test of fit of an assumed negative binomial distribution 
(case (iii) of $ 2). We denote the total number of workers by п, the numbers of workers with 
x absences in the total period by n, and the expectations of these numbers by f. = пр,. 
Тһе figures given in Table 10 are taken from Arbous & Sichel, table 1: 


Table 10. Data for absences from Arbous & Sichel. 


JEMEN | wy nj, 
NR — | | I» 
| | — 
0 7 | 11-898 | 0-09 4118 | 95 2 1668 | 0.60 жән 
1 | 16 | 16086 | 006 | 15914 | 26 | 3 1-452 0-69 6-198 
2 | 23 | 17-733 | 006 | 29-831 2 3 1-265 0-79 7.115 
3 | 20 18-072 | 0-06 | 22134 28 1 1.101 | 091 0-908 
4 | 93 | 17-667 | 0-06 | 29943 | 29 2 0-985 1-04 4-175 
5 24 16-829 | 0:06 | 34-227 30 — | 09833 | 120 - 
6 | 12 | 15-751 | 0-06 9142 | 31 1 0-724 1.38 1:381 
7 | 13 | 14553 | 007 | 11613 | 32 1 0-629 1-59 LEGG 
8 9 | 13320 | 008 | 6081 | 33 1 0546 | 183 1-832 
9 9 | 12.089 | 0-08 6-695 | 34 = 0474 | 2M WE 
| 
10 s | 10920 | 0-09 5-861 35 0411 2-43 2.438 | 
11 | 10 9-808 | 010 | 10-195 36 Eo 0-356 9.81 = | 
12 8 8-772 | 011 7.296 37 = 0-308 555 = 
13 7 7.817 | 0-13 6-268 38 == 0-267 3-75 = 
16 2 6-944 | 0-14 0:576 | 39 — 0-231 4-33 = 
15 | 12 6-153 | 016 | 23404 | 40 = 0-200 5-00 _ 
Ig 3 5-439 | 0-19 1-655 | 41 = 0-173 5-78 - 
2T 5 4798 | 021 5:211 42 = 0-150 6-69 E 
18 4 4.994 | 0-24 3-788 43 = 0-129 7-69 = 
19 2 3-713 | 097 1.077 | 44 =. 0-112 8-96 = 
20 2 3-259 | 0-31 1.227 | 45 = 0-096 10:37 _ 
21 5 2.857 | 0-35 3-750 | 46 = 0-083 12-01 = 
22 5 2.501 | 0-40 9.996 | 47 0-072 13-90 = 
23 | 9 | 2188 | 046 | 1828 | 48 1 0-062 1642 | 16-116 
24 1 1.911 | 0-52 o523 | 49+] — 0-386 2-59 t 
Total 248 248.015 12218 301-499 


According to equations (10) we have: 
Q —301499—248 = 53-5, E(G) = 49, 

var @ = 2x 49 — (2500+ 100 —2)/248 + 122-18 = 209-7, 

Ог, with the correction suggested under case (iii) of $2, 


of the distribution: 
E(G) = 47x 49/49 = 47, 


for the two estimated parameters 


var С = 47 x 209-7/49 = 201. 
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It follows by the equations (5) that 

c = 2x 47/201 = 0-468, v = 0-468 x 47 = 22-0, 
so that P = 0:3. 


X? = 0468 x 53-5 = 25-0, 


"There is no significant departure from the negative binomial peine елесің A 
account is taken of the fact that one worker had 48 absences. The iei aid auld be 
less if all expectations less than five or so were grouped, but the power of the in is still 
heavily reduced. Judging from the results of $4, we suppose that the hoodies 2 ier 
usable, although the inevitable grouping is restricted to expectations smaller than 


Example 4 
As an illustration of the testing of an assumed Poisson distri 
(iii) and (iv) of $2, we have selected the 
subperiod (see Example 2 above) 


for each subperiod *the distribut 


У 8 
bution, according to the ipie 

Е weal 
accidents of the shunt-workers in the 5 5 n 
; because Arbous & Kerrich (1951, p. 356) asserted 


: ily with the — — 
ion of accidents was found to agree satisfactorily with | 
Poisson distribution’: | 
aus Case (iv) Case (iii) 
" ^ А-1 nifi, 
а т, ат, шт, n. nz Е 
are: 
0 50 0 0 46-00 0-0 543 
1 43 43 43 44-87 0-0 412 
2 17 34 68 21-88 0-0 13:2 
3 9 27 81 7.11 0-1 114 
4 2 8 32 1.73 0-6 23 
5 0 0 0 0-338 3-0 00 
6 0 0 0 0-055 : 
: 3 s 49 ma 0-064 15-7 16:1 
— -1 E = =—— 
122 119 273 121.99 19-5 1381 


Beginning with the test of case (iv), we find accordin, 


g to equations (12): 
№? = (122 x 272 — 1192)/118 = 161-2, 


и = 119x 121/118 = 122.0. 
15 follows from (6) that 


и = 4322-4 — [243 = 2:37, giving P= 0-009. 
Comparing the columns headed n, алай, it will be зе 
wholly due to the worker who had seven 
workers having seven or 
Continuing with case (i 
and more, with an ex 


к ? distribution at a final class of € ^ 
pectation of 0-064, Using equations (10) it is found that 
G71881-122—161, HG) = 6, var = 12— (494414 2/199 4 19-5 = 31-0, 
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or with the correction suggested under case (iii) of $2, 


Е(а) = 5, varG = 25-8 
ала by equations (5), 


с = 10/25:8 = 0:387, v = 0-387 x 5 = 1-94, А = 0:387 x 16-1 = 6-28, 


Which is just significant at the 5% level. 

Application of the classical test requires a further curtailment at a final class of x = 4 and 
more with expectation 2-1. The result is y? = 2-13, v = 3, which is not even significant at the 
30% level. Тһе strong evidence of the worker with seven accidents is completely destroyed 


if only the information that he had four or more accidents can be used. 


Example 5 


As an illustration of the application of the test of case (v) of $2 to an assumed even dis- 
tribution of points on an interval, we take the 108 explosions in British mines, in a period 
of 26,263 days, recorded by Maguire, Pearson & Wynn (1952). Тһе period is divided into 
108 subperiods of 243-176 days each: 


di Ir 

т n. TR. xn, ny 
0 44 0 0 39-7 
1 37 37 37 39-7 
2 17 34 68 19-9 
3 5 15 45 6-6 
4 3 12 48 ) 2.1 
5 2 10 50 = 

108 108 248 108-0 


According to equations (13) and (6) we have: 


X? = 108 x (248 — 109)/107 = 1403, у = 108, и = /280-6 — /215 = 2-09, 


Biving P = 0:02. The approximation is certainly good for N — » — 108, since according to 
$4 it is already good for N — n — 16. 

Again, the last column gives the Poisson expected group frequencies й,, for a mean of 
1-00. The standard X? test gives y? — 5-46, v = 3, which is not significant at the 10 % level. 


6. IMPORTANCE OF TESTS OF HOMOGENEITY IN STATISTICS ОҒ ABSENTEEISM 


With the rising standards of life and social security the importance of psychological and 
Sociological grounds of human absence behaviour seems to increase in relation to purely 
Medical reasons. The urgent need of a rational control of absenteeism calls for more know- 
*dge of this complex phenomenon. If statistics are to be of any service for this difficult 


Problem, some statistical model must be accepted first. The following is an extension of the 
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del invented by Greenwood & Woods (1919) when dealing with industrial accidents. For 
ere ar ference see the extensive review of Arbous & Kerrich (1951). 
a = dad every worker has a private urn, with many white and few black balls, that 
"e о drawing a ball with replacement and that he has an БЕЙНЕ, E om ien 
absence, if he draws a black ball. The proportions of black balls in the urns may be di 


2 Ehe workers 
They are associated with the personal liability for accidents or absences of the worker 


z а А ' ed of 
concerned and with the risk and stress of their work. The same could be said of the spee : s 
drawing balls. However, it would make no difference if the number of black balls of a certa 


worker were halved and his speed of drawing were simultaneously doubled. So we are Е 
to associate the proportions of black balls and the speeds of drawing to different groups 0 

causes. Let а worker's proportion of black balls be associated with his personal character- 
isties and his place and function in the concern, things that are relatively constant in time. 


Let his speed of drawing be associated with factors that are relatively variable in time anc 
may roughly be divided into the follow 


A. Homogeneous factors, that are the same for all workers of the observed population, 
such as weather conditions, very contagious diseases, economic activity, etc. 

B. Inhomogeneous factors, that тау 
moderately contagious diseases, change: 


SER. а> | А rn 
of officers who are sociologically important, development of attitudes towards the concern; 
measures to improve working conditio 


ns and so on. 
Since Greenwood first discussed th 


ing groups: 


Е А ее ch as 
be different for different groups of workers, such # 

м : ; . ange 
s in production scheme, in personnel policy, chang 


y selected, homogeneous populations, place 
orkers and relations being such that the ME. 
uspected of being different for different groups e 

Workers. As a result of those Studies, it seems to have been established that the liability 18 
t we may regard the Proportion of black balls, multi- 
plied by a suitable constant, as having approximately а, gamma-distribution in such homo- 
utious assertions, see Arbous & Kerrich (1951). The 


is also varying in time. Tf those fa 


at 
tiplied by a number the 
plier will also с 


arly for all workers, the multi- 
all workers, that is, it will be equal for all workers at 
his case, So, i 
number of drawings and the distrib 
degrees of freedom would be hom 
gives an insignificant result, it m 
factors (B). 


п any subperiod 
ution of black ball x n-design with (m — 1) (n. — n 
: s 
Sense. Conversely, if the p 
ау be concluded that there is no reason to investigate t 


behaviour in relation to the others. Severa] Statistica] 
which of the factors (B) i i 
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has to consider in which other respects these two groups are different. At the present stage 
of our knowledge it will often be difficult to find a reasonable explanation for an apparent 
Statistical inhomogeneity. A reliable diagnostic system can be built up only if a large 
number of tentative diagnoses is already available. 
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JUMP ANALYSIS 


Ву С. Н. JOWETT ann WENDY M. WRIGHT 
Department of Statistics, Universit y of Sheffield 


l. INTRODUCTION 
When a length of time series is such as to fall naturally 


р TE ; A statistical tests 
sisting of n evenly spaced successive terms, it is often desired to carry out statistica ia 
to determine whether the probability structure of the series is unaffected by the tr ans ^ 

i i "lati thin sections 
from one section to the next or suffers a change in mean. If variation within s ate 
random, the conventional analysis relevant to this issue would be an analysis of varia 


ithi i Ч : I hat red, using 
between and within Sections, the observed variance of section means being compared, 5 


; ^ ance within 
the P-test, with their hypothetical variance, estimated from the pooled variance ien 
; ; 1 
sections, under the null hypothesis of a common underlying population mean. In man} 
applications, however, the structural assumpti 


on of random variation about a mean varying 
from section to section is replaced by that of sections of an autocorrelated series subject cd 
different displacements of mean, potentially due to assignable causes, from section 10 pol 
tion; when this is so, the analysis described above is no longer valid, and a modified or 
Substitute form of analysis must be found. Ur 


into k successive sections, each con- 


nfortunately the analysis of variance pattern 

of test leads to two difficulties. y 

The first of these arises because the observed means of different sections do not vary 
independe: 


ntly under the null hypothesi 


e mean semi-squared diffe 
ation can only be determin 


ays involves correlational propertie: 
greater than the length ofa secti 


since terms separated by more t 
be inferred from the within secti 


. T4 
à А led the between-section jump re + 
emi-squared difference (or jump) between the means of the la 
f the following Section. 
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locally Markoff variation (defined and discussed in Davies & Jowett (1958)). Under favour- 
able conditions, and for suitable choice of а, it will be shown that the statistic 


J, = (В.Т, d) 


(where W, is the mean semi-squared difference of all comparable within-section jumps) has 
expectation zero and approximate asymptotic standard error given by 


б. -/ Ган (2) 


under the null hypothesis, and provides the test of significance required. 


2. SAMPLING VARIANCE OF B IN TERMS OF SERIAL VARIATION PARAMETERS 


Denote the series by x(t), the £n observations being given by ¢ = 1, 2, ..., kn and having 
structur _— 
aan a[(s— 1) n 4-u] = p, E(5— 1i и), (3) 


where &(¢) will be taken as a stationary series for purposes of exposition, though this assump- 
боп may be relaxed and the techniques developed in this paper applied to the more general 
class of smoothly heteromorphic series (following the principle established in Jowett (1957)), 
where only the local variational properties of E(t) are effectively stationary. . 
Тһе second-order variational properties of £(!) are defined by the serial variation function 


6(т), given by the formula б(т) = EEEO -E+ (4) 
а symmetric function оЁт related to the more commonly used variance с? and autocorrela- 


tion function p(r) by the equation 
m 6(т) = e*(1—p(7))- (5) 


Denote the average of a number ж of successive values of (6) centred at t, called a central- 
ized average, by E,(t); thus, for example, £,(13) is the mean of £(1) and &(2). For successive 
values "| 2 t) is itself а stationary series, with second-order variational properties defined 

э оа 
by its serial variation function 
зыт) = EGIE GO -EUDP m 
Which for convenience in printing will be denoted by д.(т). It is related to ó(7) as follows: 
д.(т) = І X 5 E(E(t -3—4a- m) -&t-37de € 7m] 
| тнв ))-£t—4— $e 74 m")] 7 
x[t- d atm) K " 


$ $ веет 


>. _ yeu posure) 60 enne тур 
sel _4—fatr+m)—S- %-4о-тур 


be 


+4 
е3 да+ т) Eade emp) 
з ж [3m —m' 7) - 20(m m) 6mm + т), (8) 


1 
502 1 m’=1 К 
н p Hen ( м ar 
а-1) 1 Е" 1— (r). (9) 
| S ( -H) б(т+т)— За,--(а-1) а 
a 


20 r=—(a-1) 
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We may interpret 6,(7) as a repeated centralized average of ó(7) adjusted by a X 
to make 6,(0) identically zero. In most practical applications ó(7) has a M de ^ 
i onotonically, for some distance at least; д,(т) behaves somewhat similarly, ex | 1 
roin the cusp and curvature near the cusp are somewhat less pronounced. In other wor am 
6,(7) is a smoothed and translated version of 5(7). To evaluate (9) exactly it is clearly pos 

sary to know д(т) up to the lag r--o— 1, the values round т being particularly critical. 


The expectation, under the null hypothesis, of the semi-squared difference between the 
means of the sth and s'th section is 


к= ) 
0, (s—s' n), (10) 
but the serial variation statistics 


d(T) = Av Ma(t) — x(t 4-7]? (11) 

required to estimate ó(r) may be computed from within-section differences only up к: 

т=п-1. It is thus evident why such expectations cannot in general be estimated anc 

why the analysis of variance test based on them cannot in general be 
The between-section jump statistic B, is given by 

1 


k-1 
& 12 Bas +4 +402) — 2, (n 1 32)y. 
8-1 


applied. 


е — 


Under the null hypothesis that Ks is constant for s = 1,. 


E(B,) = 8,(2). 


..,k, we have 


(13) 
у stochastic series y(t) it may be shown (cf. Jowett, 
cov [2(y(t)— y(t-- 7), Ko) - yl’ ту = 3[AT9,( 07), (1% 
Where A7 is the second central difference operator at inti 

Applying this formula when y= 


Now for a general normal stationar 
1955a) that 


г егуа] т. 
Ša under the null hypothesis of constant Hs we have 


1 k k : T 
var B, — 13, „2 МА:д,(8 5 Ы 


(15) 


"WM LM j 
- E, X us [A28, (mn)? (16) 
Lats if S 
= ku +2 n). say, (17) 
m " 

where Ф, = (1 1) [A28,(mn)]? (18) 

[m| %а-1) 1 lr 2 

e (122184. ы " 
(-/” i} |23" 201-0) Aim es, аз) 
the form (19) being obtained from (18) by Substitutin: ich i$ 
9 hich 

annihilated by the differencing operation. 8/50, Un тера deo od 


The first term in the bracket in (17) may be calcul 
as far авт = 2x — 1, to estimate which Serial variati 
section comparisons provided that а < in. N 
valently, 6,(mn)) аге available for 


ated from values of 6(т) which go only 
on statistics are available from within- 


linis 9 direct estimates or 9 (mn.--r) (or equ" 
| à И > 1, but in principle at | differences 
in (18) may be estimated by averages of all Possible pro ple at least the second 

at lag mn +r, since if 


duets of within-section difference? 
вт = (E pr- 42) 5, 474 Jory 007—9) врт hay], CO 
EXp,lt,7)} = Ал (т) 


1 (21) 
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whenever 2, # +æ come from the same section and t 47, #+т-- come from the same section. 
The difficulties attached to this procedure arise through the necessity for graduating these 
averages by a suitable formula (cf. Bartlett, 1946) before substituting in (13). 

It is also useful to have a formula for the asymptotic logarithmic variance of B,: 


k-2 
var(log, 2.) iL (2+ ХУ), (22) 


k = m=1 
m 9, ә 
"7 В.Р’ oe 


where Ж 


since the essentially positive nature of B,, coupled with some evidence from sampling 
experiments, suggest that log D, will have a sampling distribution closer to normal than 
that of B,. The null hypothesis may thus be tested by comparing 


log (B,/0,(x)). (24) 


with a suitable multiple of its standard error, which is the square root of (22). 

Since the variance of the location parameters д, is estimated by subtracting from an 
observed value B, its expectation under the null hypothesis, given by (13), the asymptotic- 
ally optimal value of ж from the point of view of power is that which proves to give the 
Smallest value for (17). In the general case a special ad hoc investigation is called for, though 
consideration of the limiting cases of strictly random variation within sections at one 
extreme and smooth continuous variation at the other suggests (as confirmed in $3 below 
under the more restrictive assumption of locally Markoff variation) that smaller values of 
2 are likely to be optimal for smoother series, and vice versa. | | | 

The general technique just described is both laboriots and difficult to apply in praetice; 
two forms of simplification are possible. The first, discussed here, applies if 0(т) satisfies 
Certain smoothness assumptions; the second, discussed in $3, applies if б(т) шау һе repre- 
Sented approximately within a suitable interval centred at 7 — 0, by a function of the form 

001—0"). (25) 
but it is necessary to fit the serial 


f ‘суе i > first case 
The assumptions are less restrictive in the first case, 


lati i ў se this is avoided. 
Variation function д(т), whereas m the second cas І n 
The statistic B, is a member of a class of statistics called А the gener al 
Sampling properties of which have been discussed in Jowett (1 955a). Itisa коа o bond 
comparisons being a mean square of differences involving adjacent groups of terms of the 
" а 


; : а ке dominated by terms involving б(т) for values 
Series refore its s ling formulae are 
es, and therefore its sampling al time interval covered by the data. The neces- 


Of r whi М red with the tot 4 5 à 
which are small compare increasingly linear as 7 increases, in the sense 


за, s м it shall become 1 2 2 : 
ту condition on 6(7) is that it s (т) also becomes increasingly linear and 


deser; ; 55 [ith the result that 6, : 
re MU онай enm mute? so the smaller the value оҒа) as |7| inereases. 
apidly A д, 


Ф, of (17) make contributions which decrease rapidly 

анау М "m іп the bracket, and the 

: era ced ted by the first term in Я е 

as m increases. The whole of (17) will be domina ee she choice of x. Havi r'o- 

Size of this will furnish a good guide for the optimization of the choi y | : v- mg pro 

Visionall ho æ in this way, the practical statistician may reasonably follow the course 
ally chosen У, 


лее firs ‚о of Фу, Фә..- > Ф, ,,unless there is positive 
i i e first one or two 01 "p ^2 à : 

as gl cting all but к the fir n Pied aiv fish 

idence to warrant either t ойо 1 æ sufficient to just fy the 


Omission, 


А. 6, (т) converges г 
fthis condition is satisfied the terms 


heir retention or à red 
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Such evidence of a rather subjective kind may be furnished by scrutiny of the — 
constructed from the statistics d(1), ..., d( — 1), which muy give some idiestion of — 
upper bounds for the second differences Azó(n Pag 1), AZd(n+a—1), which straddle the 
range of second differences Ағ ó(r), from which A7 5,(n) is computed аз a weighted average. 
More objective evidence would be the significance of a relevant mean of the product 
Palt, mn) of (20); this is essentially a straightforward mean of terms from a time series, albeit 
generated from &(¢) in a rather complicated way, and its standard error may be estimated 

by the general method given in Jowett (19550). Neither method is particularly sensitive, 
but at least they provide some opportunity to check the working assumptions. 

Usually advantage may be taken of the availability of a direct estimate of the dominant 
term of (17) which does not require explicit consideration of the serial variation curve- 
Corresponding to the statistic B,, which is the sum of semi-squares of what might be 
described as between-section jumps, is a statistic W, which is the corresponding sum of 
semi-squares of comparable within-section jumps, defined by 


k n—« 


- шыту X Es- Int} tutga) -z,(S-Im4-3--u— 42). (26) 


=1 и=а 


a 


Independently of the truth of the null hypothesis, 
E(W,) = 8,(2), e 


so that W, provides an estimate both of the expectation of D, under the null hypothesis 
and of the dominant term in (17). On general grounds the relative error of this estimate may 


be expected to increase with a, being the same as that of B, for о = 3n, and this must be 
borne in mind in using it in 


e (17) as a basis for a large-sample test of significance. More 
explicit consideration of the error of W, under the assumption of locally Markoff variation 
will be given in $3. 

Although the exposition above is strictly applicable only under the assumptions that E(t) 
18 stationary and normal, the test is ro der departures from these assumptions in 
tions ү | 1957) that formulae for sampling properties 
of local statistics in stationary series could be applied as an approximation in non-stationary 


ion, he implications in the present 
case are that formulae such as (17) may be applied when sva plications in p 


1 - riation in £(t) is locally cci 

erva s comparable in length to the maximum lag т for which 
we torma oE thp formula are non-negligible. The rapid convergence of (17) ні vulnerable 
mainly to short-term periodic tendencies of autoregressive type, аще: be taken as 
a multiple of the quasi-periods. 3 вв а. can 


in (17) to be neglected, it becomes possibl 
calculating the jump criterion J, defined 
value of zero, using its asymptotic sampli 
2/(k—1). In this form the test itself becomes extr 
being rather one of assessing whether the conditio 


he test of significance for В, by 
by (1) and testing it against a null hypothesis 
ng variance as given by (22), which reduces 0 
emely simply to carry out, the difficulty 
ns necessary for its validity are satisfied 
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Also, considerations of power may dictate a value of x larger than that for which they can 
be satisfied. 

To explore the potentialities of J, as a test statistic, a numerical investigation was carried 
out using the two parameter Markoff serial variation function: 


0(т) = 0(1— p). (28) 


Numerical values taken were p = 0-0(0-1)0-9, 0-95; n = 12: E ce: a= 1 (1)6. It was 
unnecessary to take numerical values for 0, which as a scale parameter does not affect the 
sampling properties of J,. 


From (16), for large k 1 += " 

var B, ~ i У d[N9,(mn)]?. (29) 

Si 1 1 k k папа ТЕЙ; бс» 
Similarly, var W, = — 1&5 У У Х.МА40,8-ғп-ч--Ур, 30 
` SEDE k(n — 22 ly. s'—l и-а u'-a є Л (30) 

1 +o n-a S MA à4 
T в У DS X szd mn- u+ u’), 
пеге mite ЖЕР n dE (31) 
1 k-1 k n-a к ---, н 

cov (B,. И’) X XE X MA24(6-sncu)p, (32) 


= ДЕ 1) (n а) 2,2 ime 
ШЕМЕ Сей у "У HAZ, (mn Tau). (33) 
k(n — 2a + 1)m--o waa” 
Formulae (29), (31), (33) were computed in the cases described, taking 0 = 1, and the 
results substituted in the formulae 
var (B, — W,) = var B, — 2 cov (B, W,) + var W;, (34) 


var (DB, — W.) Е 
var Jy ~ [EOV Sa (35) 
to give the results in Table 1. 
The following broad conclusions emerge. 2. ' 

(i) Below р = 0:6, kvar (В. = И.) decreases to a minimum at or near « = 6 (i.e. in); 
above р = 0-6, it increases from a minimum for g = 1 (i.e. the smallest possible value of a). 

his indicates which values of « give the most powerful test. 

(ii) Тһе pattern of rise or fall in k var (B, — И) with a strongly resembles that of E(W.), 
except for p = 0-6, where variation in either is small. This confirms the appropriateness in 
this case of the practice recommended in $2. namely, choosing as the optimal « the value 
Minimizi 7 

(iii) M be kvar J, < 2:2 when a < 5 (actually when « < 6 for p < 0-3); at and 
above p = 0:6, kvar J < 2-2 when « = 1. Thus if we are prepared to content ourselves with 


a Slightly suboptimal value of а in certain cases, we can cover ourselves against errors of 


the first kind at least to the usual extent by working with the formula (2) which provides 
Ч With a simple test along the lines described at the beginning of the section which is not 
restricted to small values of о in cases where a large value of ж would be optimal. The factor 
72 isa convenient round number which is not unacceptably Vago. 

(iv) Tn the cases mentioned in (iii), the error committed by neglecting all but the dominant 
erm in (17) proves to be negligible. Evidently the increase of the factor from 2 to 2.2 ig 


Pecasioneg by the need to allow for the sampling error of W,. 
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Table 1. Variances and covariances of jump statistics for Markoff serial 
variation function with n — 12 


а жез”) 
" 0-95 
E | 0-0 | 0-1 | 0-2 | 03 | 04 0-5 0-6 | 0-7 0-8 | og 
S " 
k var B, 
z -005 
1 2-000 | 1-620 | 1-280 | 0-980 | 0-720 0-500 0-320 0-180 0-080 | 0:020 О 
2 0:500 | 0-540 | 0-558 0548 | 0-508 — 0-440 0-346 — 0237 04197 0.038 1 0:019 1 
3 0922 0268 0315 0-357 0385 0-389 | 0359 0290 041821 | 00657 0-031 3 
4 | 0-125 | 0160 | 0-200 0-246 | 0-292 0-329 0-344 0.316 7 0-2332 | 0-096 4 0-044 4 
5 | 0-080 0106 0-138 0-178 | 0225 0274 0.3151 0.3262 |09744 | 0.1306 pe 
6 0.056 | 0-075 | 0-100 | 0-134 | 0-177 1 | 02307 | 0286 2 | 0-3296 0-313 10 | 0-167 10 : 
(Percentage increase over dominant term alone shown in italics) 
k var W, 
.001 
1 0265 0.202 0-152 0-111 0078 0052 | 0-032 | 0-018 0-002 | T c 
2 0-088 | 0-096 0-100 0-100 0-094 0-082 | 0-066 0-046 | 9-008 0001 
3 | 0-058 | 0-074 | 0-089 | 0-104 | 0-116 0-120 0:114 0-095 0:023 0: 18 
4 0445 0-061 | 0-080 | 0-102 0.196 | 0148 046100457 | | 0055 |00% 
5 0:047 0065 0089 | 0-119 | 0-156. 04197 0-235 | 0-253 | o103 | е 
6 0-056 | 0-075 | 0-100 | 0-134 | 0-177 0-230 — 0-286 — 0-329 [0167 1 0-09 
k var (B, — W,) 
1 2:083 | 1-701 | 1-354 | 1043 | 0770 — 0-537 | 0-345 | 0195 | 0-087 0022 | gm 
2 | 0.519 | 0-563 | 0-586 | 0-581 | 0-545 | 0:477 | 9381 | 02605 | 0145 | oos | 00 
3 0-231 | 0-281 | 0331 | 0-378 |0412. 0423 | 0400 | 0.333 | 0-221 0-083 pis 
4 | 0123 | 0-158 | 0199 | 0-246 | 0-294 | 0-338 | 0368 0.363 | 0.299 0-140 Dr. 
5 | 0-074 | 0-098 | 0-128 | 0-168 | 0-218 | 0-281 0-353 | 0-418 0-414 0218 qe 
0-056 | 0-077 | 0108 | 0-151 | 0-212 | 0.298 | 0-412 | 0-536 0-574 | 0.325 0-1 
| E(W,) 
1 1000 | 0-900 0-800 0-700 0-600 0-500 | 0-400 050 
‘ M ө t à | 0300 | 0. Го: 0:57 
2 0:500 0:320 0.528 | 0-523 0-504 0.469 | 0.416 0-344 т ls "t 0072 
| 2 |0808 006 | 0597 | 0-422 |0439 | 0-441. | 0424. | |10580 |0198 |0097 
£ 102501 0282 | 0-817 | 0-851 | 0382 | 0-408 | оаа |е 10202 |0909 | % 
5 |0200 | 0-230 | 0-262 | 0-298 | 0-335 0:370 -396 o |0548 |016 145 
к | "M | | 0-396 0-400 0-363 | 0-248 0 
6 | 0167 | 0193 | 0.224 | 0258 | 0297 | 0-337 | 0.373 | 0-395 0,378 біле (0 
EQVLEQV,) 
1 | L000 1-000, 1000 1-000 1-000 1.000 o 
00 1-0 : 0 
2 | 0-500 0-578 0-660 0-747 0-840 0.938 "en d Pro: күре E 
3 | 0333 0407 0-496 0-003 0-732 0.882 1-060 147 |1960 | 1:380 1:940 
4 |0250 0313 0996 | 0-501 0637 |0612 | толь rd Bie on 2440 
5 |0200 | 0-256 | 0328 | 0-496 | 0-558 | 0-740 |090 | 1320 | 1990 | 2160 | 90 
6 | 0-167 | 0-214 | 0-280 | 0:369 | 0-495 0-674 | 0-933 | 988 | 1-816 2.480 3.340 
| l 1:317 | 1-890 2-750 
k var J, 
| I | 208 | 210 | 212 913 | 9.14 2.15 р 0 
2 2-07 2-09 2-10 2-12 2-14 ems E | 2-16 2:17 | 2-18 22, 
| 3 208 210 |210 212 24 (әм 22» |794 |929 2-35 270 
4 1597 1-98 199 2.00 | 2.02 9-06 Bde | 2:30 2-43 2-60 3.17 
| 5 184 | 185  L86 | 189 |194 948 ar Ек | 2-61 3-00 13 
| 6 200 | 206 2415 |226 |241 263 29 "is | 3-14 3-55 Zot 
| | 3-44 | 4.02 4-30 Ж. 
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Тһе numerical investigation just described involved a considerable amount of com- 
putation and yet covers only one particular case. То justify application to other cases it 
was necessary to examine the relevance and robustness of the conclusions drawn under the 
following conditions: (а) departure of the serial variation function from strict Markoff form; 
(b) departure of n from the selected value n = 12; (c) the value of k not large. 

(а) If the serial variation function may be represented in the form 

(т) = 0(1— p) + (т), . (36) 
where o(r) is negligible for r < 22-1 and has a second derivative so small that AZw(r) is 
negligible for т > x. the effect of the term (т) on (21). (23) and (24) will simply be the 
Addition of an average of these negligible quantities. 'This more general form of serial 
variation function. described as locally Markoff and discussed in a wider context in Davies 
& Jowett (1958), appears to have sufficient extra flexibility to serve as a robust working 
assumption in the absence of quasi-cycle variation, analogously with the normal distribu- 


tion in classical theory. 
For example, the function 
5а) = 0-81(1 — 0:87!) + 0-19(1 — 0-2") idi 


illustrates a common type of divergence from the strict Markoff form, namely a slower 
decline to zero in the initially steep gradient, and is roughly representable in the form (36) 
for p = 0-6. The sampling parameters of Table 1 computed for this function proved to be 
Similar in essentials in their pattern of variation, being nearly optimal at 1, E(W) varying 
little, and var J being less than 2:2 for @ = 1 (and also æ = 2,3). — | 

The robustness of the assumption of locally Markoff serial variation is of course greater 
the smaller the value of æ; however, its use need not be restricted to small values of a. 

(b) Supplementary investigations similar to that yielding м L БЕЛЕ n = 6, 
% = 1,2,3, p = (0-2)2, (0-6)? and (0-8)? and n = 24, 2 = 1, 8, 10, 11, 12, р = 402, 0-6, J0-8 


Were carried out to test the extension of conclusions (i)-(iv) to smaller and larger values of n. 
alent to 0-2, 0*6, 0-3 in terms of section length. 


Тһе val eg 
ues of p taken are equiv? : "ON 
It was found that, аз before, the patterns of kvar (В, — И) and Е W,) were similar; « was 
Optimal at unity for the highest values of p and nof greatly suboptimal at one less than its 
Maximum for the lowest value of p. 
. There was also evidence to suggest 
in terms of section length increases with п, 
gth incre: н Е 
% = 6, above it for n = 24. Also 2:2 remained à safe approxima 
Sponding cases. 
(c) Asa rough check on the app 
ы to gain experience, jump апа 
arkoff series in each of the cases: 


hat the transitional value of the Markoff parameter 
the middle value being below the transition for 
tion to k var J, in the corre- 


asymptotic theory for values of k not large 


:cability of the 
asit t for twenty artificially constructed 


lyses were carried ou 


3: n6 k = 10; 


as consistent with symmetry, satisfying a x? 
o out of the sixty values fell above the 


р = 
м, distribution of J, Wi 
bred d that only tw 


Soodness of fit test for a normal curve. re: 1-645 J[2:2/9])- 
Ччаѕі порта] upper 5 A point at 0-813 (= У — 


25 
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T xs m afd. әй 

The formula (2) is therefore proposed as the usual criterion for the 2... 
ae 1 hich is adequate to cover the non-va 

it makes a small allowance which is q | Pp ina 
cee аты E in the appropriate Marko 

ioni the error of W, at least in the pI ; 

f the summation in (16) and Е | ) Ше 

к= which in the absence of evidence to the contrary, may reasonably be taken to 
an 


other cases, or at least will be better than making no allowance at all. 


4. COMPARISON WITH THE ANALYSIS OF VARIANCE TEST 
Since only adjacent sections are contras 


st some loss 
ted in jump analysis, one would expect some | 
of power compared with a method ba 


sed on the same principle as the analysis Sigon 
where the means of all possible pairs of sections are effectively com pans. -— iiy 
approach is, of course, not usually possible when the serial variation curve is not ч атағы 
completely; nevertheless, the comparison is still of some interest, and we shall = Tofa 
large k and an assumed random normal variation in the parameters ji, characterize 


н н : and 
iati w taking the serial variation curve as being unit Markoff in ved см 
ing n = 12 as in $3. Under these conditions, writing В.о as the value of В, unde 
null hypothesis, we һауе 


DBs = 6-12, E 5 (nga da) = Ene 4 — Jay] (и, Heal 
8-1 8-1 


(38) 
The covariances of the three terms on the right-hand side of (38) 


of the first is given by (17), and the variances of the second and 


"jance 
are clearly zero; the varian 
given respectively by 


3 be 
third are easily shown to 
39) 
0 (3—4) к 
4 
and 2(k— 2) 0423,(0) — A28 (а) + 2028, (n), ( 
so that asymptotically 


1 
kvar В, ~ kvar Bat 307 + 205 [A70,(0) — A20, (п). Bu 
Also 


42 
ЕВ.) = E(B) +02, di 
80 that the estimate of ст given by 

Estas = В,— EB.) = B,- EW) em 
has the same sam 


pling variance as В,. 


. H 5 
Alternatively, we have an estimate from the caleulated variance var &, of section mea” 
Be ts B ч 4) 
given by Est, ой = уал, — Elvar, A (4 
Hence, similarly to the preceding argument, 
var (Est, o?) = var (var,z,) 
A 8) 
= var (var ё ) + var (var, ji) 4.9 уат [2 cov, (Е, , и), T 
where the suffix c implies that the varia; 


nce or covaria; 


: опе: 
5 nce is a calculated, not expected. 
Now, using formula (5) of Bartlett (1946) we have ыша d 


var (var, Ё) ~ *: (2, (оо) —Óz, (sn)y? uis 


and it is a standard result that var (var, и) ~ 201 
E 
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For the third term 


pe 


var [2 cov, (£,, 4)] ~ 73 E *: 
LE Elem td амды | 
s 14 g—1 


k? TE 
4. ® os " 
" BÀ ci var[E,(s— 1) n 3 4n)] 
(48) 


ES 


a(z, (00) — 9, (0). 


= 


We hav 
e used th for А " 
Ені. о? ese formulae for asymptotic varia 
4, 0, relativ 2 nces to compute t 5 
Тһе results = pie Est, оз for n — 12 and a number of different icd i a n z 
re given in Table 2. They all settle down to a limit of 67%, ie. Ф 5 and с. 
0,16. 3 aS 7, > оо, 


Table 2. Asymptotic efficiency of estimate of o? from B, compared with 
that obtained from vate s 


с? S 

S р= 0:0 р= 01 р= 0:6 = 0:9 
(unit = 02) Opt.a=6 | Opt. &= 6 | Орь. а= dpi. и pe 

1 54 54 63 1700 

1 63 63 67 262 

4 66 66 67 99 

25 67 67 67 71 

100 67 67 67 68 
iance оў estimate of сп from уату £, 


sed sampling var 
rrelation 


Table 3. /ncre& 
о within-section serial со 


due to presence 
mate from random data. 


Ratio of variance (percentage) to that for esti 


(unit = с) р= 01 р= 0:6 р = 0:95 
EE кенес ==" 
кенет a — Е == 
0 254 1818 
F 140 346 
145 


ow in the cases p < 0:6, the 
se. In the cases where the limit 
ably greater than 100% 


ficiency бё! be consider 
jue of unity in our calculations) or sm: zller. 
stimate of E(B.) 80 that this asymp- 


a 
gis; in the other cases it is more 


a . 

қыта it from above in the cases P 7 

is imum jump-analysis value of € being cho 
approached from above, the relative © 


when o i 
cis of the same order 
р that W, supplies а 
E 7 ium analy: 15; H h 
. However, the graphs are 


as Og 


to practical jump -B 
ble with that of Pao 

а compa ounds, namely, that as the variation 
fficient than any jump 


It is; 

18 " 

toti in precisely these cases 
с comparison has 2 rele 


abs 
со tract, since W, has & varianc 
nsistent with what we would expect 00 general 8 Десе 
a test based 0n riance 18 
25-2 


of 
&(t) approaches randomness; 
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" $ arkoff 
is. It should not, of course, be forgotten that even in cases where the acting ps a - 
s А » с. len. Eos 
pur unicum curve has a small parameter, the true curve may diverge from it fo 
seria 


i i si arry out, 
ter than the section length, rendering a test based on variance impossible to carry 
grea : s 
for the reasons given in $1. 


That jump analysis сап be more efficient than a test based on variance, ж. E i 
practicable, in cases where the parameter is large is not intuitively surprising; а i. 
parameter reflects smoothness, and discontinuities from section to section Би к 
creasingly obvious as this smoothness increases. This gain can be almost large enoug 
offset the loss due to the presence of serial correlation, id ding 
Tables 2 and 3; this occurs when the entry in Table 2 is almost as large as the correspon ies 
entry in Table 3. The information yielded by jump analysis on seriall y correlated data as 
fraction of that which would be yielded by variances of ran 


dom data with the pet 
болтан т 
persion is given roughly by dividing the former by the latter, and is evidently someti 
quite small. 


ison of 
as can be seen from comparisor 


Illustrative example 
The data in Table 4 below consist of 1 


" š : 3 been 
56 terms of an industrial timo series which has be 
arbitrarily broken up into k = 12 suc 


i ; ‘ ; m ach. 
cessive sections of n = 13 consecutive terms 6 t 
The analysis given is fuller than would be warranted in many practical applications, 


e e 
even so begs certain questions (e.g. testing the goodness of fit of the serial variation — ) 
which lie outside the scope of this paper and might reasonably form the subject of inquiry 


Table 4. Weekly determinations of percenta: 


ge of free carbon in coking 
coal as charged (origin = 46, 


unit = 0-1) 
DEM 
Section 

Term | 1 2 3 4 5 6 1 8 9 10 11 е 
140 | 71 | 67 | gi 39 | 46 | 58 | 35 78 46 24 ue 

| 2 | 27 | 47 | 60 | 49 | gj 05 | 51. 50 | 32 | зв | за | 32 
3 40 65 58 65 46 60 38 43 | 43 | оу 35 20 
4 45 44 | 58 59 53 62 57 54 аз | 26 35 Ы 
5 34 65 70 | 59 45 50 53 4l | 82 | 24 28 os 
6 | 42 | 53 | 48 | 5g | о me | 84 | as | ва | 38 | 44 | 26 
7 30 56 53 | 58 35 46 69 34 46 | 30 46 28 
>o a 0r уа ЗЕ 58 | 40 | 44 | 42 | зв | 2 
9 54 | 58 59 49 29 53 т 43 35 | 37 33 еқ 
10 | о | 96 | 66 | 45 | зт | № AER SE | ae ЕЕ 
| 66 | 72 | 72 | 45 | dg | m 43 | a5 | as | aa | 37 | 26 
12 | 57 | 86 | вв | ЕРГЕ 27 | б 35 39 | 28 | 29 
13 49 | 47 | 68 | 46 | 36 | 57 23 50 | 49 | og | 99 33 


Тһе first necessary step is to caleu 


sections for z = 1, о = 6. We have c. 
the bulk of the labour consisting of 
æ terms for the 156 successive terms. 


late and com 
aleulated ір for thes 
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In view of the remarks in the paragraph following that where (16) occurs, a choice of 
9 = 6 seems optimal, giving the value 30-1 for W,. The value of B, for x = 6 is 32-1, so that 


J, = In (32-1/30-1) = 0-0643. (49) 
From (2) the prima facie standard error of J, is 
(92/1) = 0-447, (50) 


so th ; i к ғыз 
еқ at J, is not significant оп the 5 % level. No further analysis is necessary, since any factor 
or - 23.9 1 ~ 2 s Р 
E Тө correct than 2-2 in (50) would certainly not be less than 2. In more critical cireum- 
star isi 5 i i i 
ances, however, the need for revision of (50) might be investigated along the following lines 
© T 


р (a) Scrutiny of the variogram 
For fitting a locally Markoff serial variation function to the data the least-squares method 
ын & Jowett, 1958) would be appropriate; however, the breaking ofthe series into short 
E = and the non-availability of d(15) prevent the tables in this paper from being used, 
de ақыны ына do not i ustify the labour of an ad hoc computation. However, as was 
Y Davies & Jowett it may fairly be assumed that the fitted function will reproduce 


Table 5. Values of W, for coal series 


a Odd sections Even sections АП sections 
1 61-9 87:8 74-9 
2 48:0 34:0 41-4 
3 51:1 31:4 41-2 
4 50-8 31-9 41-3 
5 35-1 32-5 33-8 
n 97-5 32-7 30-1 


| 
| 
| 


үу (which ің identical with W,), and it is both convenient and reasonable to obtain the 
Suman equation necessary for the estimation of the two Markoff parameters in (28) by 
ing W/W, to E(W;)/ E(W;) in Table 1. 
ence fr қ 
from Table 5 Est (1 — Est p) = 749, 

“nd the value of 0-402 for W;/W, suggests the value 0:33 for Est p (using inverse linear inter- 
Pol ation in the appropriate section of Table 1), so that Est ô = 112. 
Же. Variogram has been plotted in Fig. 1, this fitted curve being shown, The later serial 
: n. 10n statistics are very erratic, being based on only very tow comparisons, and their 
к. Tue from the fitted curve is probably of no consequence; the standard error of d(2) 
ie 11 of Davies & Jowett, 1958) is roughly 16 and соор of 26, though not. 
Vari meing in itself, taken along with the evidence of а (2) ana 4(5) suggests a serial 
fone curve such as that shown by the broken line as d dnd likely prima йы. 
о Owing the suggestion in the antepenultimate paragraph of $2, we extrapolate the Gen 

a value of 197 at r = 14, and find that А66(8) = — 19 for the broken curve as compared 


те Jump analysis 


ion i H i igible for 
: 12 for the Markoff curve; a fair working assumption is that Aș ô a wearer 
ee Mon Taking the average suggests values of Y^, in (22) of about 0-06, 
ре trifling. Hence the use of (2) is not contra-indicated. 


160 


Prima facie 


112(1 0337) 


60 


2 4 


6 8 10 12 14 
Lag 7 = 


Fig. 1. Variogram and fitted curves for free carbon series. 


(b) Assessment of mean Product of differences 


Using (20) we obtain the following series of values (rounded 


а: 
) for рді, 13) from the dat 


ЕНЕР l: T 
t plt) t p(t) t pit) 
Á 

13 149 65 =1 117 —35 | 

14 233 66 0 118 —91 

26 5 78 0 130 1 

27 78 79 -3 131 8 

39 — 3 91 1 143 5 

40 - 59 92 45 144 -18 

52 - 5 104 86 

53 —15 105 26 — | 


var (log В.) = ia. (i) _ 24 
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Suggesting a value slightly greater than that given in (50) for the standard error of J,. 
However, the variation in the figures is considerable, and there is no prima facie case for 
relying on this correction. 

16 is possible to make only a rough estimate of the standard error of 7, in view of the 
nature and shortness of the series. The serial variation curve for p,(f, 13) may be expected to 
be complicated in shape, in view of the many common values of x(t) involved іп neighbouring 
values of p(t), and its estimation from the data is quite impracticable. However, the series 
of means of the pairs of adjacent values is more tractable, a fair working assumption being 
that the serial variation curve flattens out by a lag of about 26, for which the successive 


means have no values of a(t) in common. 
The analysis of semi-squared differences (following Jowett, 1955a) between the eleven 


n х ; 
esulting means is as follows: 


Source | Sum of s.s.p.'s Number Mean s.s.p. 
АП 433,450 121 3,582 
Lag 0 0 11 0 
Lags +1 38,143 20 1,907 
Other lags 395,307 90 4,392 
Henco Est oz = (4392 — 3582) = 28. 


This can only be a very rough estimate, as will be obvious to any statistician who looks 
at the figures on which it is based; however, it suggests that the value of p could easily be 
fortuitous, and there are no compelling grounds for making the correction suggested above. 


We would like to thank Dr H. C. Hamaker, who first drew our attention to the need for 
* Solution of this problem; to Miss H. M. Davies and Miss D. Kirk, who have assisted us 
With the computation; and to Mr J. Hebden of the National Coal Board and to the Carbon- 


ization Branch of the Board, who kindly supplied the data. 
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THE RANDOM WALK [IN CONTINUOUS TIME] AND 
ITS APPLICATION TO THE THEORY OF QUEUES 


Bx С. В. HEATHCOTE axo J. E. MOYAL 


Australian National U niversity, Canberra 


SUMMARY. The random walk in continuous time has been considered by a number of authors 
for particular types of boundary conditions, 


random walk with one absorbing or one refl 
sequently by Bailey (1954). The 


y yields the solution for any other солан ате 
ese are of interest in the theory of queues with exporentia’ 
alk with reflecting barriers at О and № (84) is the single вагувт 
ossible customers is limited to № (Morse, 1958). By letting 
т single server queue problem solved previously in the papers py Lagar 
by Bailey referred to above and also by Clarke (1956), Champernowne (1956) ant 


queue problem where the number of p 
N — co we obtain the familia; 
mann & Reuter and 
Conolly (1958). 

The passage to a ' diffusio 
the well-known method of 
general process wher 


n’ process is also considered. 
images (Chandrasekhar, 
е the transition intensities А an 


The method given provides an alternative 36 
1943; Bartlett, 1956). In $5 we consider а more 
d д depend linearly on the particle's position in a 
bounded interval, and are constant outside this interval, This enables us to give the Laplace transform 
of the probability generating function for the N Server queue, a problem recently considered by 
Karlin & McGregor (1958). 


1. IxTRODUCTION 
A particle will be said to describe a r 


chances A, и per unit time of taking on 
axis. For prescribed boundary condit 


andom walk in continuous time if it has constant 
e unit step respectivel y to the right or left on шеше 
lons, such as absorbing or reflecting barriers at Pm 
of the particle’s position at time t after the start 9 


; )88 
d al axis (as сап be done without lc 
во that the initial position of the particle at ¢ = 


of the processes studied we seek Р. | (t). the probability of a tr 
We use the ‘backward’ equation (Bartlett, 1956; Feller, 1 


dP, „(0) 
“а = (А+), „+ AD, 


se th 


: "or each 
0 is k, an integer, and for ea 

"Pe ч ime t 
ansition from k to n in tin 
957) 


1-1) 

Aa) RP, a(t). | 

Тһе probability generating function G,(z,t) = УР, (t) 2" then satisfies 
n 

961, (2,1) > 

и = —96,(,0) Аб 


(12) 
к+ї(2, t) * AG, (e, t), 
where & = À+ y. Clearly 6,2,0) 


= zk, 


92,8) = LUG (s. 0] 
it follows from ( 1-2) that 
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The system consisting of the backward equation, together with the side conditions, has 


a unique ‘honest’ solution (‘honest’ in the sense that УР, (0) = 1, or equivalently 
n 


6,1,0) = 1, 9(1,8) = s-1); in fact ‘honesty’ implies uniqueness and conversely. This is 
Well known to be the case for (151) without side conditions (cf. Bartlett, Feller, (ос. cit.). 
With side conditions it can be proved to follow from the general theory of discontinuous 
Markov processes given in Moyal (1957) by transforming the system into an equivalent 
Integral equation of the type considered in that paper; the explicit solution for the single 
absorbing boundary case is given as an example of the general theory in Moyal (1959). 
l'he method of obtaining explicit solutions developed in the present paper is however much 
less laborious, especially for the more complicated side conditions, but we still rely on the 
Seneral theory for the fact that the ‘honesty’ of each solution thus obtained implies its 
uniqueness, 

Asymptotic equilibrium distributions are known to exist for this type of process. Their 
explicit expression is most easily obtained directly from the Laplace transform solution of 
(1-2) by an extension of Abel’s Theorem (Widder, 1946, ch. v), which yields 

lim sg,(z. s) = lim 6,2,4). (isd) 
в->+0 to 
Whenever the limit on the right-hand side exists. It is in fact sufficient for (1:4) to be true 
that the right-hand side exist as a Cesaro limit. 

There ің another type of asymptotic behaviour of the solutions which is of interest. 

Namely, the passage to a ‘diffusion’ process. We effect this passage as follows: Let 


а = Аъи = о], А-и = а], x=nh, y= kh. (1:5) 


We assume that each step is of magnitude h, that o and a are constants, and we look for the 
distribution of the displacement 2 conditional on y as h = 0. Let 


Г«|»0 = lim jj, m(t). (1-6) 


Carrying out this limiting procedure in (1-1) one arrives formally at the ‘backward’ diffusion 


e : н 
Wation with drift term 


д | Bd @ x |y; t). E 
5 «|ш0-із gi * yj |y; t) (1-7) 
It win be shown later that this passage to the limit in the solutions of the random walk 
Problems yields the solutions of (1-7) with the appropriate boundary conditions. 


г THE RANDOM WALK PROBLEM 
2. THE GENERAL SOLUTION OF THE RAND аы 


To Obtain the general solution of the difference equation (1:3) js need а particular solution 
and the Solution of the homogeneous equation obtained by omitting the term zk. Since the 
sition probabilities clearly remain invariant under translations of the origin it follows 


‘at Gaz, t) = 2kG,(z, t) and hence 


ә 
ай) = ее) (k= 0, 1, E2 e. 
COM" 
‘bstituting in (1-3) we have 
афо А2271) = (5,8), 


9%(2,8) = ( 


9 Random walk ата, its application to the theory of queues 
40 
so that à particular solution of (1-3) is a 
gi (2.5) = zd (z, s). 
Substituting g, = w* in the homogeneous equation we obtain T 
о қ 
u*(s а — Au — uu) = 0, 


the two solutions of which are the zeros of 9(2, s), namely 


uis) = (2A) [s +æ — J((5 +a)? — 4A] | (22) 
апа uals) = (2A) [s +æ + J ((s +a)? — 4g]. 
The general solution of (1-2) is therefore Ж" 
4,2,8) = z*ó(z, s) + A(z, в) ий (8) + B(z, s) и в). @ 


be 
і і = А must 
We require that 0, — 0 as s — co, so, since му > 0 and u, — co as s — О, 


ы А 1у one 
bounded, while B must tend to 0 with 1/5 faster than uzt. For problems involving опу 
boundary we set B — 0 so that (2-4) reduces to 


(2-5) 
0,2,8) = z*o(z, s) + A(z, s) ий в), 


ints 

where A is determined by the boundary condition. Boundary conditions at two point 

determine both A and B. ‚йа 

If no boundary conditions are imposed we have the unrestricted random walk, aly 
solution of which is (2-1). The Laplace transforms of the probabilities P, (t) may be гей 


я : : ; 4 гей оп 
obtained by expanding (2-1) in a Laurent series for values of z in an annulus centred i 
origin, which contains the circle |z| 


m ‘ it 38 
= 1 for all s whose real part is positive. Clearly 
sufficient to do this for Ё 


= 0. We have then 


о = 2.6) 
Go(% 8) = A7! (ug uy) | X (игу + M hay] . | 
T- r=1 
ms 
We note that g,(1, з) = 1/8, so that the solution is unique. The inverse Laplace transfor 
may be found directly from tables (Erdelyi, 1954) to give for the probabilities 


1) 
P,,,(t) = Breed (on AU) (r= 0, +1, +2, сыз); e 


out 
ction of the first kind. Through " 


Gy.) = гехр[ 
which may be expanded to verify 


the results Obtained above, 
If now we substitute from (1-5) and set z = gion zk 


(9:8) 
-ia z= peo), 


= e in (2-4), then 
w(0,s,y) = lim 90е", в) 
һ->0 


= eh (54 1020? —ia0) + a(0, 8)exp | 


m. а ay 
o (52-9) 


төле Дені, 97 
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where a(0, s) = lim A(e™, в) and b(0, s) = lim B(e#, s) and w(0, s, y) is the Laplace-Fourier 
= h—0 


transform of the solution of (1-7), i.e. 


(0, 8,9) = І edt | et f(a | y, t) ах. (9-10) 
0 J- 
In the unrestricted case a = b = 0 and 
ы» EN. 211-3 ox |(r-y-—at 
f(a |y; t) = (270%) вех | mo mah (9-11) 


3. THE RANDOM WALK WITH ABSORBING BARRIERS 
If the states O and N, say, are absorbing barriers, the process stops whenever the particle 
reaches either О or М so the boundary conditions are 
ф = 18, gy = “7/5. (3-1) 
The values of the constants А and В appropriate to this process are found by substituting 
the general solution (2-4) in (3-1); with these values of A and B (2-4) becomes 
gy = (zk — ul) фз + (871 — 9) (2X — ug) (u$ — ui) (ud — ui) (3:2) 
k 
= (uf — u7) [n wg) (ий ш) + zz —u,)]^ (u$ — u$) (u3 7 — и) 
r= 
N-1 "E т Т € eee 
ar IS раа а) (ud 7 D (08 r—uj 7) zNs 8-9). (3:3) 


x (Uz Ua)" 
г=к+1 

Inverting the Laplace transforms gives for the probabilities 

Fi, o(t) = 6-8 5 pe е- (ӘУ + К) loge — [j-)0N-kH Lagann- dr, (3:4) 
1=0/0 

Dt) = fr-* x е {Toga n—ker t Туун Р (т = 1,2,..., Е), (8:5) 
j-0 


Pit) = pr S e аума ТК Бук чча) 

xs (7 =k+1Lk+2,-.,N-1), (3-6) 
re? (9+0) N -k]fojzoN-k 7 [027+ 1) 7 + 0] Дерма). (877) 
e symmetrical in A, p and k, N —k as 


t conditional on initial position ķ is 


В. x(t) = ВУ“ Y à 
j=0J0 


The absorption probabilities PaO 
expected. The expected value of the 
o ft 
PNT ET |, 
(PIN + k) pom, + pN-([(j o 1)-N +k] 
-[Gj- 0N-H Тео} dT. (3-8) 
d using (1-4) is 


and P, v(t) are 
position 7 at time 


-ат 


-Ці-т)е 


x (f-*(2( +1) N — E agen vt 

x Toj+DN+k 
. tribution obtai 

The generating function of the stationary distribution obtaine 


N- 
lim G,(z,t) = lim sgy(2 5) = Tit? Ув | Е 
to s>+0 JAE — (uA А (ИГА) if Афи, 
em чае” C А 


1— KIN +210 


(3:9) 
if дед 
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which yields for the expectation E(r | Ё, со) 


ма (ИГА) (шут if A+ py, (3-10) 
E(r |, о) = А E йек. 

Тһе boundary value problem discussed above is the continuous time analogue r 2 
classical ‘gambler’s ruin’ problem, in which two gamblers with initial capital Ё and 1 it 
respectively compete. As may be deduced from general arguments the asymptotic resu 
we derive are identical with those of Feller (1957, chapter xtv). The probabilities of кеміне 
ruin are т, and y, respectively given by (3-9). Since z,-- y, = 1 ultimate absorption а 
either one of the boundaries is certain, and the expected lapse of time before this occurs 


BUIE = | “КРЕДО + Py, у 


= [Ez 50 9 C - (gar if Аял, 


(3-11) 
БОТ — k) (24) if ЛЕД. 


The solution of the problem with a single absorbing barrier, say at the origin, may either 
be found from the above by letting N + со or directly using (2-5). Taking the limit as N — © 
in (3-2) we have for the generating function 


бк = (2 — ub) + stu. (3-12) 
The last term in the right-hand side of (3-2) tends to 0 as № 


В and 
> co since ш < |z| < ап 
щ < l, ù > 1 for Res > 0. The probabilities in this case are 


t 
Pt) = up f тле ат, 
0 


(3:13) 
Proll) = Phe La) (1,23...) #14) 
with expectation 
t 
E(r|k,t) = k+(A-p)t-(a =) a= | T\(t—7) e* I, дт. (9:15) 
0 
These three equations yields incidentally the relations 
2 t , 
ХИ, ь- Lm pretr | 771 galt—r) Idr (3:16) 
из 0 
and, by putting А — y, 
„У т, (ай) — 1, (о4)) = веч (k=1,9, Ж (8-17) 
Asymptotie results are well known (see Feller, loc, cit.) and are 
P, (0) = Тыш if Ady. (3-18) 
1 И Ax, 
В, (00) =0 (r= ыў (319) 
үе И Ag 
E(r|k,oo) = Ih if As Р (3:20) 


0 if Az, 
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Probabili iti i i 
Ри ind densities of first passage times through a barrier are given by differentiating 
а absorption probability. For example, in the single barrier case, the rob- 
y density of the first passage time through the origin is, from (3-13) | — 
pi (D) dt = ЕВ--1е— I, dt. 


Passas ль ЕЕ 
assage to the ‘diffusion process’ limit in the case of a single absorbing barrier at O yields 


10(0, s, у) = ei?" (s —ial + 3220?) + [1/5 — (S5— iad + 10?0?)] exp — Е +? Je + =) 
ee o?) f 
(3-22) 


(3-21) 


3-22) is the Laplace-Fourier transform of the 


Ф ; ; 
he second term in the right-hand side of ( 
entiating with respect to ¢ we obtain the 


е кк : : à 
ашу of absorption. Inverting and differ 
Probability density p of the first passage time at the origin 


HEN SE 
pt. y)dt = ағы (нана а. (3:23) 
Note that o _fl if a<0, 

Js pt. y) dt = fa if az ^ (3-24) 


L ; 
et fec, t) be the density obtained by setting y = 0 in (2-11); then the probability density 


1n the absence of absorption is 
t 
fix last) = fiie t- | fist otn nar. 
ymmetric form of the transform w(0, s, y) is obtained 
translating the origin of x and y to 42, 


(3-25) 


| Set Nh = L, a constant. A more 8 
п the case of two absorbing barriers by 
= 2 
w(0, s, y) = yr eio + = pear (езе зіп oy +11) 
sinh oL 
(3-26) 
& 1 20 = | d = 1 Ба 
(s—ia0 + 300) anco 7g J V о 
densities at 1L and —iL are obtained by inverse trans- 
+101 jn the right-hand side of (3:26). One finds 


— еб sinh w(y — $L)], 


wher 
e / = 
y= 


T А 
ац first passage time probability 
"mation of the terms involving € 


exp аи] 2 nth) 0—0) 
2n43) L- 
P4(L y) dt = JOnt) «25 » | ! 


1 
хехр Ес [(2® + 12-0] |4, 
(3:27) 


а 
те Ее AL) + iat | E 
exp PAAR, )*? } т [(2n 4-3) 191 
p (t, y)dt = тов) dies 
1 2 
xexp Еш ША >. 
е distributions at 4L and — 1L. The 


the first passage tim itions al 
of wirds has an expression similar to (3:23), 


7)[p,(r. 3) +P 1] dr. 


Wh 
тке P+ and р_ аге respectively 
ability density in the absence 


t 28 
felun oie nn |і ш 
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4. THE RANDOM WALK WITH REFLECTING BARRIERS 
We now take the barriers at О and N to be reflecting ones, in which case the boundary 
conditions satisfied by (2:4) are | = 
(8+ А) 90 = 0151, (8+) Gy = Hya +2". 


Solving for the constants A and В as in $3 we obtain the solution " 
Б +2 

б = PEX peN а) + 2k + Y, — 271), | 

where Хк = А (wht — ul) — (u, us) (ul ий) (ud — a1), 


Y, = Аз (ииз) (qug 17k — ur 1-1) — (um ау) (ий — ий). 
(4:2) may be written as 


= BEE (1-8) и) e OX G24 (1-2) a} 3 
> d+ (1-24) uk4(1-w)7} as Now, (б ) 
: Е е 
(4-3) is thus the solution for a single reflecting barrier at the origin, a result which may b 
checked by direct methods. . 
Expanding (4-2) and inverting the coefficient of 27 yields for the probabilities: 


ж 
B, (t) = fr-ke- D, арғы “у, олан + 1, 30-0 (-)— kr) 


о tt 
А Y, | 67 Ceasers area anara 2 ума ас 
j=0J 0 
- T 4-4) 
 лхеумена 7 28 Djs cassa + В Mana dT (r= 0,1,2,... N) ( 
In order to lighten the algebra, we give the expected value for the special ease k = 0 only: 


o (ч 
Bir]. = а-ы ay f =r) e= Фот) йт, 


(4-5) 
where Фет) = -%(2У-1) Lbjya-[2( 41) N 4 1] Byson} 
+54020) +1) Мух М) (27+ 1) + Плох 
-[Gj-- 1) N -1] Tossa} р 
Asymptotic results are most easily obtained by applying (1-4) to (4-2). Thus the generating 
function of the stationary distribution is 
Qe (quay) io 
lim 802,8) = lim G,(z,t) = | 02-5) (uae Н ЛФ, (4-6) 
5+0 t>o (1-гуы) А 
| (1-2) (WE Ж Aeg 
and hence (A=) (u]A)N- 
ма OMA UT 
lim Р, (0) = M1 (шуну Н Аж, (£7) 
4->ю 
МОТ +1) if ЛЕЙ. 
The expectation of ғ in the stationary distribution is 
IN [1 — (way) 
E(r,oo) = x- ІА ] (fA if Agp, (48) 


ыу if ЛЕД. 
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In the context of queueing theory the random walk considered above is the single server 
queue problem with the restriction that the size of the queue is < №. In this case new arrivals 
do not join the queue if there are already № — 1 ‘customers’ waiting for service. The prob- 
ability P; (t) given by (4-4) is then the probability of having r 'customers' in the system at 
time ¢ (ineluding the one being served) conditional on k ‘customers’ at time zero. We list 
below the La place transform of some formulae of interest. Inversion of these results in- 


volves simple but tedious calculations which we do not carry out. 
The expected length of the waiting line L at time ¢, conditional on initial length &— 1 is 


E(L|k—1,t) = E(r | 5.0 +B, olt)-1 


ала has the Laplace transform 
fuk + 0 и) Xy + uz (us — 1) Y) 


ZUR |k-1,0] = (А) sts (E 34 Y,7 Xi) Ааш) 

(4-9) 
and, by (1-4), 
E(L|k—1,0) = {N-{1 — (uJA)) 1 —/A)-H ( — (Аун ү, 


The distribution of the duration of a busy period, p;(/), is obtained by considering the 
random walk with an absorbing barrier at the origin; i.e. with the boundary conditions 


go = 1/8, (8+) 0х = Му +2. (4-10) 
Proceeding as before the solution of (2-4) with these boundary conditions is 
© * 4 
gy = sV + Ф — 2%) W+ zk 1), (4-11) 


Where V, = (ufus - 1) + иуи (1 —24)) (ud (uy — 1) + ut (1 —4)) 


and W, = (uk — ut) fug (us — 1) + uj (1—24))71. 
ill be absorbed at the origin before time 2, conditional on 
ability density, р,(), of the duration of a busy 


(4-12) 


The probability that the particle w 
Initial state &, is .7-[V;/s]. Thus the prob 
Period has the Laplace transform Фри = ҮМ). 
have the familiar single server queue 


E i i N ->œ and we 
еа анау ЕП The appropriate generating function 


Problem in which no limit is placed on queue length. 
18 given by (4-3), and the transition probabilities by 
| x 26-а + Tenes] dr) (r = 0,1,2,...). 
В. (0) = Ве, al e {В iur ^ k+r+l 2 
iii ( id 0 (4-13) 
iven by Ledermann & Reuter (1954, p. 366), obtained 
ted value оЁ is 


t 
e» (k+1 7 em% т. (4-14) 


(4-13) is identical with the result g 
Y the use of spectral theory*. Тһе expec 


B(r|k,t) = ke cite SPE 


lowing authors: Bailey ( 1954), by solving the forward 
arke (1956), by solving an integral equation of 
ility argument; and Conolly (1958), by an 
ast three of these references. 


ned by the foll 
tion technique; Cl е 
irect proba! 
olte: А wne (1956), by а direc 
ноа орото ps are indebted to the referee for the 1 


8 * This result has also been obtai 
quationg using a generating fune 
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, PU .14). The 
ected length of the waiting line can be written down from (4 18) and (8 : ae 
dii cam density of the duration of a busy period. p}(t), is given by differentiating (+ i 
pro 4-15 
thus pi (0) dt = КВА te T. dt. ( ; 
: : Е кек 1951). 
The ergodic properties of this process have been investigated in detail by der at eed 
In order to obtain the ‘diffusion process’ limit, set again Nh = L and translate "i ne 
of x and y to 3L. The reflecting boundary condition is seen to correspond to efley (28) 
+ &L, and one finds the transform w(0, s, y) of f(x | y, t) (using the same notation as in (: 
М -ау/о?2 А Y 
10у eoi? {e940 А cosh w(y + 1L) + e+ cosh w(y — 36). 
2o sinh wL d 


(4-16) 


w(0,s, y) = ету 


5. QUEUES WITH N SERVERS . 
қ 5 zeller. 1957 15) 18 

The exponential arrival and service time queue with М servers (Feller, 1957, p. 4 ) 
defined in our notation by the equations 


" 1) 
(+++) дь = Agua иа (k= 0,1,...,.N), ( 


53) 
(5A МА) gr = Айыл + д +2 (em М,М-1,...). ( 


t 


д 


А 4ейу the 
Before giving the solution 9x(2,8) of the system (5-1) and (5-2) we mention briefly t 


а . inear 
generalization of (1-3) to the case where the coefficients of the difference equation are line 
functions of Б; 


5:3) 
{kla +b) +s+ a+ bo} gy = (ka, +a) Irsa + (kbi + bo) дра +2". (5 


The homogeneous equation obtained from (5-3) is the well-known hypergeometric ie 
ence equation, and its solutions have been studied in detail by several authors (e.g. Batche . 
der, 1927). In fact there are twenty-four such solutions corresponding to the twenty for" 
solutions of the hypergeometric differential equation (Batchelder, 1927, p. 101), and ie 
may choose two of these appropriate to our problem, say /(в) and h,(s). A particular 80 и 

tion of the non-homogeneous equation may be found by taking the Laplace transform 0 
the generating function G x (2, t) of the original birth and death process. (7, (z, t) can be foun 


readily by standard methods and has the convoluted binomial form 


ацал) = ahb Цель — 1) + (а, Буви), 
4,—b, 


x MESS ae, (54) 

The general solution of (5:3) is then LE 
.Б 
9к(2,8) = (а, 5) + A(z, 8) fils) + Biz, 8) №, (8), ы 
where үа, s) = 716,2, 01. As before A and В шау be chosen to satisfy given boundary 
conditions, қ 


There are several interesting variant For example, a linear birth and death 
: е , a 
process with a = 0 = В, and reflecting barriers at ] — №, №; № > М, could be used 48 
. H Р 2-7» 2 1: о 
linesrised оде) А ss (Kendall, 1949). The essential feature" 
Kendall’s logistic model is that the coefficients а, quadratic in k, so that the states № ® 
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Since (5-1) and (5-2) are special cases of (5-3) their solutions are, respectively, 
42.8) = V s) А(а,ө)”ө) (k = 0, LE. N), (5-6) 
gules 8) = 242,8) + Bl s)ul(s) (bo N,N +1,...). (5-7) 
(s) and ó(z, s) are the same as in previous paragraphs with j/N substituted for д; 


(2,8) = 9 (1—( — =) e=] exp[— (p) (1—2) (1 7677]; 


7 is 
= ipaa Y (^) ес? рыи: ваў: (Миа 5 
PU" ifi ; [и rj 1; fg) (1—2)). (5:8) 
1 
vy(s) = | егдзін 2*(1 =w) dx 
0 
= B(k4 1, ви) (k+ 1; Ё+1 +; = Аи), (5-9) 


Iu Б(т, п) and F (m; n; x) are the Beta and confluent hypergeometric functions, respec- 
ring The boundary at k = 0 being a natural one, A and B are determined by the equation 
Ог Jy(z, s), yielding the solution 
2N-1(z — а-ә) 
Ё TN ы - и), (р-0,1,..., М), 


Uy — бу 


2,2. 5) =. 
i ммр ву) BEAN [ess diss Vx 
up Цех ххх), (Lo NONAL,....) 


А 
m Uy — UUN-1 


^ 


(5-10) 
The Laplace transform of the expected number of ‘customers’ in the system at time t, 
Blven % initially, is 
ФЕ (в |%,t)] = 


нЕ А (иМ-А) ,__ 1 а =) (N — A[p) +u). 
Alfa + ү; a ros 1) A(uw,— 1) n ; 
(k = 0, s LN). 
5-11) 
"N-17?N aS v—GO — Nu)v, 
А- A=) (9x3 — *? ) 0 1-A/p)ey- (2 JH) Va 
tjs R0 80 и “4 UM ed 


(k= N.N way 
a Ry = rly — ts yi] 7 
S, = uk-N uy- ttr]. 
> A, may be found directly from (5-10) 
the limit are considerably simplified by 
ent hypergeometric functions (Sneddon, 


(5-12) 


which exists for N 
lved in taking 
for conflu 
Fas ys o) caufis 7: =). 
—Ух and Vy — UYN- and expanding as a 


нь : 

vd Stationary distribution, 

"i. (1-4). The calculations invo 
Sing the following recurrence relation 


1956) 
(zly) (ағыу а) - 


Using (5-12) repeatedly in the terms № V xi 


Power series, we have 
lim sg,(2, $) aln 6,2,0 

s—> +0 ->o 
NAMM ey УЛ; uN >A, | 
á | X e^ А (5-13) 


Biom. 46 


26 
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NS Ale, $ Quy ] 

дени қ-а +5 гух 

(5:13) agrees with the result given by Feller (1957, p. 415). 


e 1 к е. 
The barrier at th origin of the System (5:1) and (5-2) is a natural reflecting on 
imposing an absorbing barrier atthe origin 


(5:14) 
fo = 1/8 


) гі density 
and solving (5-1), (5-2), for k > 0, and (5-14) for k = 0, we can find the probability dens 
pi 0), of the first passage time through the origin, i.e. the de 
period. Say this solution is Ix = gi(2,s), k > 1. Then 


48,4) 
dt 


nsity of the duration of a busy 


5:15 
Pilt) = = 4" (sg(0, в). (5:15) 


> неп pple) 
[Note that | ры) dt = 1 if and only Ним > А.] The moments of the distribution px 
0 


1 lace 
may be found directly from (5-15) using the following elementary properties of the Lap 
transform: If .Z(f(t)) = g(s) then 


q" 
LG} = (~ 1p C99 


ds? 


t 

s Дт) ar| = s-lg(s). | 
0 

Using (5-16) and (1-4) the expected value of t" 


(5: 16) 
and 


conditional on k, H(t” | ©), is 
a [^ а" (5:17) 
Be" | = Hm [пруда = (сау E oso, S). 
1->ө,/0 ds” 
The solution of this absorbing barrier problem foll 
modification that we require the general solution о 


stants. It is easily verified that a 
from (5-1) is 


HR 


8-0 

ich the 
ows the same lines as before, with ез 
f (5-1), k > 0, with two arbitrary ed 
second solution of the homogeneous equation орва 


k k 5.18 
wale) = 2 (шута - 
The general solution of (5-1), k = 1, 2, ..., N, is then 
.19 
9,(2,8) = Vries 8) + A(z, в) vels) + C(z, в) w, (s), m | 


М b- 
where у, and % are given by (5-8) and (5-9) respectively, С can be eliminated by 8" 
stituting (5-19) in (5-14) to give 


Ir = (и, — Wr Yo)} (вю) 
Elimination of the constants A a; 
give the expression for the requir 


.20) 
+4 (из, — wi a) wg. e 


we 
) and (5-6) gives the solution g;(2. 5): 
8) fork > N only: 


NVo) (шу + у, — SU. 10) 


nd B from (5-20 
ed quantity g;.(0. 
(шоор 


900, 3) = = (Wty 100100) (Wy + St ry — 80 о) > (52 ч 
зш (шу — 00) =u (w °v-1— Wy_1%)} 
where the function (=, в) appearing in (5-21) has been evaluated atz = 0. 
Application of (5-17 


) to (5-21) yields the 


k > М. Results for 1 <k<N may of cou 


44 fot 
moments of the duration of a busy period 
tse be obtained by the same method. 
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NOTE ON THE COMPARISON OF SEVERAL REALIZATIONS 
OF A MARKOFF CHAIN 


By J. H. DARWIN 


Department of Scientific and Industrial Research, Wellington, New Zealand 


1. INTRODUCTION AND SUMMARY 
Consider a probability chain defined by X(t) where t takes in succession the values tf к 
where ¢;,,—t; = т,т > 0. Suppose the chain can be in any of s states at any time ! 5 M 
describe the event ‘the chain is in state j when ¢ = t; by saying X(;) = j. Then the cha 
is called u-dependent if Prob (X (4) = Ј) is a function of the и values 


X(;—r) X(t—27), 


We suppose this function is independent of i. 
А 


es X(l— ur). 


great deal of research into u-dependent chains of this kind has been carried out m 
recent years. Bartlett (1951) discussed the asymptotic distribution theory for function? 
of long realizations from such chains, that is for realizations consisting of X(l,), .... X(ty) 
when N is large. Following this preliminary work, writers such as Hoel (1954), Whittle 
(1955), Good (1955), Anderson & Goodman (1957) and Goodman (1958) have developed 
asymptotic tests of significance which, for instance, compare the maximum likelihood 
(m.l) estimates of the probability (X(t;-- zr) = j | any set X (t; + (p — 1)7), ..., X(¢;)} with а 
given set of values of these probabilities, which examine the length a of dependence of the 
chain, which are suitable when s is large, etc. 

In this note we discuss а one-dependent or Markoff chain with s states. We suppose 
(X() = k| X(;—7) =j} = Ру. We shall refer to pj, as a transition probability. We suppose 
also there is a unique occupation probability Р, by which is meant P(X(ty) = j) tends to 


Р, аз N tends to infinity. We suppose P; and ру, exist for all jand k. 
We develop: 


(a) A test of the equality of r sets of transition probabilities Pir This test follows auto- 
matically from Bartlett’s work. f 

(b) A test of the equality of r sets of transition probabilities Фи Separated from a test 0 
the equality of r sets Pg (1— 


t . Dj). j + k. This division of the test (a) is useful when 7 is small. 
in that it then throws light on the reasons for the differences in occupation times of the 
8 states from set to set. 

(c) A test of the equality of r sets of occu 
T is not small. The test criterion is difficu 
modern electronic computers. We show 
assumption of independence of X (t;) and 


parison is being made of the occupation 


pation probabilities Р,. This is of interest ho» 
It to calculate on а, desk machine, but not on 1108 
» When s = 2, the direction of inadequacy of the 
X(t;+7) when X (t;) is a Markoff chain, and a c0?" 
numbers for two long realizations. 


Bartlett (1951) showed that: 
0) та, the number of values of j for which X(t) = J. X(t;+7) = k or, say, the number of 
transitions from state j to state k is for a lon i ‚вау, Е 


E realization approximately normally и 
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tributed with mean NP Pjr and variance NP p(l = р). where N is the length of the 
realization: cov (nj. mg) is — NP; PirPa- We suppose all P, are greater than 0: 
(ii) if n jo = М ту. ny is for large N distributed approximately normally with mean 
Е 
NP, and variance proportional to N, so that т) [CN P) tends to one in probability; 

(iii) the maximum likelihood estimate of Рум ^ j[n;. is for large N uncorrelated with 
ШИШ if j + 1. 

It follows from (1) that for large № the s values т, k= 1,...,5 have а multinomial 
distribution with index NP, and parameters ру. k = 1,...,5; from (iii) that these s multi- 
nomials are independent of each other; and from (ii) and a theorem of Cramér's (1946, 
P. 254) on the replacement of parameters by quantities which tend to them in probability, 
that in the multinomial for the s values п. k = l, ...,8 we may replace № Б. by nj, 

We consider testing the hypothesis that ғ sets of values ру. are the same, that is that Pir 
where] = 1 r, is independent of [| for every combination j, k. The data are > long realiza- 
tions one from each of the respective chains. We are on very familiar ground because the 
Preceding paragraph implies that the test must be related to a combination of tests of 

E i e е " а тру ер . * P 
Srxgs contingency tables, each table corresponding to a value j. The likelihood ratio criterion 
© v < Dy VA 
т.с.) to test the hypothesis is 
Е S 
AEn log, 71,4 — Xn; 1106, n; ,— Ха log, nj, + Enj, log, m;..]. (1) 
In this and later formulae a dot in place of a suffix means summation has been carried out 
Over Е Mete п). = NX тц. The remaining summation signs 
Ver the replaced variable. Thus, for example, nj., зм 5 М g 


indicate that summation is to be carried out over all possible values of the remaining suffix 
Variables: n n_,, with equality for all j, if the last state is the same as the first. 
"29. мыт; p". mal j 4 Bref nw Ne 
It is easily proved. by a standard expansion of (jm — nj. Pj) A; that to first order in 
i Vip m түбі iter ; 
10 7, this criterion has the same value as the sum of s y?-type criteria for s rx 8 con- 
i $ : 5 рде х re is that of y? with s(r — 1) (s — 
gency tables. The distribution of the criterion (1) therefore is t X ( )(s— 1) 


deer, : 
Brees of freedom, 


‹ 7 N PROBABILITIES FOR SMALL 7 
3. THE COMPARISON OF TRANSITIO 


Suppose the chain is the result of observation of a process defined in continuous time. We 
NOW take the parameter t of $1 to refer to time. If 7 is decreased M ME = the 
°ngth of dependence of the resulting chain is increased or ore! ype К = pan e 

"appears. We consider only chains which are still Markoff chains for such low values of 7 


that itis reasonable to suppose no change of state has been missed—that is that there hag 
asonable to s , 
96h at moc 2 ween times t; and t; T. | 
Most one change of state s i isshown by theexample in educational theory that 
racti А 


hat su қ 

ch a case may occur in p defi GENE Ve | 

К » г). An educator defines а number of в ates in 
Ompteq the work of this paper (see $5 below ). 


В i il asking a question, etc. i 
ch a cl ' her giving formal instruction, pupil asse 4:4 ‚ ete. This 
ass may be: teacher giving a, even for low т. If 7 is smaller than the average 
an, ev 


ба, 1 1 "К гі Г + ж 
(ds to a chain, approximately Markovi hain stays for the shortest time it is unlikely 


t unt of time spent in the state in which the c 
is any change of state will have been missed. 


then becomes possible to make а simple interpr 
T val 


etation of comparisons between sets 


ues р. үү қ 
ыы” 4. = X(t;- (n—1)7); X(t, m7) = ka| X(t) = by} 
i n үле 1 


=j = X(t,+2r) =.. = (1— Pigs) Pins PH A= Pip) ә. (2) 
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- 1 p T Г 1 iginally sidered, and 
1 ө j сіп the Markoff chain origina lly CODSIC 

Chen 1 lays the role of sky J + К, int Я 

I ( Dj) ji P o 


e p rs tor t i I ity v,,is a con- 
hat chain. The quantity v;; 18 

y se of v, amounts to a change of aramete f fo tha e m 2% ба wed 

Е iti = al ro batis the probabilit. that if the chain has moved from sti e. s e 

ditional p у, у 


arvati 0166 it 
ili i si j for n— 1 further observations 
bability that the chain stays in state j a 
ume rmi ra pe Е Dj. This geometric distribution can then be grece adm 
rea 22 à . " ati 
ram of time spent in state j. [6 corresponds to an exponential dis 
1S 


; 3 debi i tinuous 
xp ( — At) А; Е of time spent in state j for the corresponding process defined in con 
es BM) rj 
time. 


T i ion time of 
It is commonly desired to compare the occupation times (by the анаа ыш и 
state j we mean the number of times ¢ the state Jj was observed) for two or more long 


tions. For chains of independent observations this com 


i i inary 
parison is made by an ord 
contingency 


m special 
table analysis. For values of7 not small enough to enable us to make the sp 
assumptions of this section, the nearest equivalent is 


ion 
a test comparing values of occupat? 
probabilities В. We discuss such a test in 


4 : the 
$4. When the assumptions of this section —- ia 
sev of t 
test of $4 and the test of $2 can be supplemented by a separate test of the ^s ind pn 
Sets руу. Such a test will be useful for instance when two sets of occupation times е Mem 
equal, but for one realization there has been a rapid fluctuation from state to state, 


Е $ в state. 
for the other there has been а slower rate of change with longer periods spent in each 
The likelihood ratio criterion to test the hypo 


‚ equal 
thesis that r sets Ән, l= 1, ..., r, are 64 
regardless of the values of the sets Руд is 
У ты log, тн X(n; ,— Na) log, (nj ,— уу) 


(3) 
7E у. log, nj, + (n; ту )log, (лу. т]: 

1%Е 
This is of the same form as ( 


1) with n; | 
to suppose the Ты k =1,. 


e 
— "уд replacing n, ү. Tn considering (1) we were «^ 
-ә8 had a multinomial distribution with index т; | for Jange К 
If now we consider Тл fixed in this multinomial distribution the conditional о жении 
the niu, j +k, for large № is again multinomial with index 2j.4—Njq and parame joy 
Фум/(1— Фуд). It follows as in $2 by analogy with the analysis of an их (s— 1) ана P 
table that the distribution of (3) for large n; , — "уд із x? with s(r— 1) (s— 2) degrees of à " 
dom. Since this is independent of the 7,4 — пд provided they are large, it is the ge" ha 
asymptotic distribution of (3). These degrees of freedom virtually follow from the fact * 
the s(s—1) values Pyal(l—pyq), j + k, have one linear restriction 

The likelihood ratio 


for each value of j. ual 
criterion to test the hypothesis that r sets Pi L=1,...,7, are ed 
regardless of the value of the Sets ®у is 
2[2(5,.— пул) log, (n; у it) + тор, пул — Yn, | log, т; 1 4) 
=n. —n5 log, ( i 


LM 773.) = En; log, n, + Хт, log,n;..- 
For large т; this latter criterion i i 
since for given LR Nyy i 
tribution being indep 
criteria (3) and (4) 


ples; 
X? criteria from s r x 2 ta dis- 


ations of т, 
) In the a 


ns. These latter criteria annoyin: 
accurately possess this additivity. 


z 
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For completeness we give the test of the equality of sets Adi Ооо нана 
process when 7^; is the amount of time the Ith realization passes in state j. The likelihood 


ratio criterion is 
D y “ 
?|Y(n, оа) low, (n; ,— nya) - Sly., =.) log, (n; — 15.) 
= (у ,— njn) log, Ту + (n; — n5; ) log, 7, 1. (5) 


This is, for large п; , — n, distributed as y? with s(r— 1) degrees of freedom. 


4. THE COMPARISON OF 7 SETS OF OCCUPATION PROBABILITIES 


If for general Markoff chains a test of the equality of r sets pjj, 1 = 1, ....7, had shown no 
ver-all significant differences between the sets it might be considered illogical to test 
differences between occupation probabilities P, since these can be expressed in terms of 
the Рум. It is, however, conceivable that the sets pj, may differ on the whole so slightly that 
the criterion (1) is found to be not significant, but that the particular functions of them 
represented by the P, may show important differences from one chain to another. 
“cordingly, we develop an asymptotic test of the differences between two sets of occupation 
Probabilities Р, starting from the fact (Bartlett, 1951) that thea; are normally distributed 
with means № P. for one realization. We then give the obvious extension to r sets Py. 
We begin by айыр the covariance matrix of ће я; for a single long realization of length 
from а chain with transition probabilities pj; Patankar (1954) has shown that 
varn;, ~ PE -v) Pj) - NB,.- ЛР), 


(6) 
ШЫТ ASQ v) PU- NBP, 
соу (Nj, ng.) ~ Z(N—») PR -EUY v) Ру je 
e 
of the chain being in state j when ¢ = & and in state k 
of transition probabilities ру; is called T. We shall 


e are already assuming that the only root of modulus 
for example, Bartlett, 1955, p. 26) 


жәнге P9) is the stable probability ‹ 
" en t= (+ vr, Suppose the matrix 
Onsider that the roots of T are distinct. W 
ne is one, Then 7 has a spectral resolution (see, 


Т-Х- ЖАР), (7) 
AF 
Where x is Р, Boe Й 
f Д = 3 
E tity Xj P(N — v) is then 
Аза root of T and P(A) is a matrix pg) of order s x s. The quantity >Р 
©xpressible N-1 N — v) Ари). (8) 
j BQNO 05. X Z ed 
i a . 1 м о 
Since À is less than one (Fréchet, 1938, р. 105) this 15 asymptotic tor large o 


тр Y py(A)Al(1—A)- 

I yN(N - 1) FA - NF, 2 Ра 
"troducing this into (6), we may write "- im 
уагт, ~ N(2P, х (j.j)th element of (I — T + ) 7-1 


Ж 
Cov (n, m, )e М(Р,х() Lyth element of q-r+#) х) P. P. 
go ny) ~ МР, х (j; EP, x (Ый element of (1—7 + Х) -РВ). (10) 
> x (k, 
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= Хуу; thi lacement is permissible since 1n 
We have replaced 2 рудА)/ Ay ty 07 ths тер P 


: : ; er 
this sum A is less than one. If we write D for the diagonal matrix, diag (^. .... P). we тау 
write for the covariance matrix of the т, , 


NDDU - T - X)" - (DU T + X) 3 - DU - X)]. (11) 


— M: 
Тһе matrix (11) is singular since Ул; = №. We suppose the first s— 1 of the т, have a no 
singular distribution which we write as 


1 к 1 (т. бү в) |а em РТИ (12) 
(yxp | 7878) sag n) |... 

А А 7 , ler 

where the row vector n; |N — P, ....n, 4, [N — P, , is written (nj.[N — Pj). Then, unge 


р ` ics cac i ee /8 
the assumption of а common known A for the two chains being compared, it easily wer 
that the likelihood ratio interior to test the hypothesis that the веб P; for the realiza 


i i z 007 At 1ра- 
with oceupation numbers 7.5597 4, is equal to the set Р, for the realization with occupa 


tion numbers m, ,...,m, , , is 


MN ии Cs x) А (13) 
M+N\M N M м 
The similar criterion for testing а single set of occupation probabilities against а hypo" 


thetical set was suggested by P. S. Neal (see Patankar, 1954). The joint frequency function 
of the m; and the n;, may be expressed as 


-MN (т, ml (т) m; 
const. exp cre (af x) 4(% N 


1 nj. (m5. 4 nj. 7 (M5. (14) 
arrn n) nn) a(x) (е). 


M 
The non-singular transformation 


z = n |N- P,- (m; [M — P), | (15) 


wj = N(n;.|N -P) + M(m; [М — P), 
enables the criterion (13) to be expressed purely in terms of 2). The exponent in (14) is m 
fact the sum of two quadr 


placed by an estimate. A is a function of th? 
timate (mj, *ng)l(m; +n). Under the es 
for large N + M distributed normally with ү. 

Dj; and variance proportional to 1/(N + М). Tt follows that the estimate tends in probability 


will often be feasible and cheap (the imp 
provided by an organization with access t 
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Examination of the criterion (13) for a small number s of states throws light on the 
behaviour of the X? test of a contingency table of occupation numbers when the chain is 
one-dependent and not zero-dependent as required for the validity of such a test. We discuss 
the simplest example s — 2. + 

4-1. Two states; s = 2 

T is now (рі a and X is 5 5) 

Dei Pe Р, Р, 
where Р, = Po,1|(P1,2 P21) and P, = Pysl P1. 2 P21): 

1- Т+Х| = p, o po, and if the matrix (11) is called VN 

y = Dina t Peo) | ie ) | 

Piet Par sil 1 
MN (n; |N- mj |MY (P1,2+ Po,1) (16) 
(U +N) PL PG t P2,2) 
is asymptotically distributed as ү? with 1 degree of freedom. | 
In calculating this criterion we use the maximum likelihood estimates 
Pus (Mi, 2+ та) (т, t) fai (тәл +71) (ть. + то,.), 
Dii (m, 4 4-4, (т, 2-7,4.) and р... = (Mo o+ 75,2) (т, + ms, ). 


It follows that 


Since туз ~ ть 1 and э ә ~ Тәу We may substitute 
(my, + 100+) and (ть, +n.) CM +N) 
+m, , and N = m, +n.. We may compare 


for P, and P, respectively, where M = mı, 
Р ы 2 table 


this criterion (16) with the usual criterion for the 2х 


т. Ts, 
|n. i s 
CUTEM (M + №) (ni, ms, — Ma, m, (17) 
15 latter criterion is MN(m,, n, ) (ma, + %2,.) 


ИР 1and P, in (16) are replaced by their maximum likelihood eq 1. 4. 
etween the criteria is the extra factor (P1,2+P2,2)/(P1,1 + pa) in (16). ыр. ^ 29 5. i на 
ne-dependent but the x? criterion for a 2x2 table were used as though there were zero- 
*pendence, the Speni valueof the criterion would not be one but (1,1 + P2,2)/(P1,2+ P2,1)- 


idi „ductions drawn. For instance in the 
us factor TG TH `b the validity of the deduc ‹ 
um m еше Бере educator may define only two states which may, 


education. in $3, the 
al example referred to in $ 5, в ан Е А : 
от example, be атаме апа pupil-initiated activity. An се E e T ae 
[9 Servations smaller than the average time spent in these си wl D y i ue mer 
Py and p, dominate рь and p respectively. Then in using д? one would run the risk of 
nu 255 12 21 
"aiming more significance than is really there. 
4-9. The comparison of т sets of occupation p robabilities. 
Suppose it is desired to test the hypothesis that 7 = 
the same set of occupation probabilities. Then 4 
We can show that the likelihood ratio criterion " ж 
X Ма Aa - (X; МА = XN(u-94(-9. (18) 
124 Aa 1 
1 D 


g realizations are of Markoff chains 
a simple extension of the argument 


Of § 4 
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In this the summations are taken over the r realizations for each of which there is an МЕ 
welch x, of quantities of the type n; /N or m; |M of $ 4-1; is the length of the ith wi 
tion andi z = Мз]. The form of А is given by (11)/N with the maximum likeliho 


i iteri 3) is distri „жй Fe 1} (с-з 1) 
estimate nj, |n;. substituted for ру. The criterion (18) is distributed as y? with (r — 1)( 
degrees of freedom. 


5. EXAMPLE 
The only example on which the methods of this Шыдар 
arising from the researches of Dr Ned A. Flanders of the University of Minnesota. The 


Ў а 224 {еуег- 
data exhibit such extreme differences that a statistical analysis is scarcely needed. Nev 
theless, calculation of the criteria of the previous par: 


54-1 апа also shows the usefulness of the tests describ 
Dr Flanders describes six different states in which 
(1) Teacher accepts ideas, feelings and gives prais 

in the problem solving process. 

(2) Teacher asks question, 
(3) Teacher 


paper have been tried is provided by data 


agraphs reinforces the argument of 
ed in §3 when they are applicable. 

a class may be, as follows: we. 
е so as to integrate student behaviot! 


lectures or gives his own opinions and ideas. 
(4) Teacher gives directions, crit; 
(5) Student talks. 


(6) None of the above; including silence, confusion and administrative-routine state" 
ments made by the teacher. 


icism or justifies the use of his authority. 


йыт 
re observed at short regular interv ci 
h observations suggest that it is арр”. i 
ша Markoff chain so that the methods 


i 5 е 
wh . Tvations in the different states are larg 
enough to inspire confidence in the use of our asymptotic tests 
The data are, for the two classrooms: 
sete. 1 |e lala]. "m 
State ... | | | 3 4 5 6 L | 5 в 
| | 2 3 4 9 
ай Б ый | ыза 
| | | | | | 
1 22 | | 24 2 | | | : 
| | Ж | 2 7 3 6 | 0 
2 | 5 3 1] 8] | g 1 9 ішім! 2 
© | |10 | 25 | sp | ae 2 | 9 [оов | 32 | 5 | 18 
| 4 | des 2 | 714 | s | 14 | 98 0 14 | 32 | 108 40 E 
e 1%] 22 | а | mz | $ | iel шр | 9 | 26 |22 | 1% 
1 | 1 
| | 18 | 134 1 5 | 13 | 53 13 | i 
| | 
The criterion (1) р Ж 


to test the hypothesis that the two sets of transition probabilities are ipe 
3 161-0 which on 30 degrees of freedom ig very significant. This А 
пе equality of occupation probabilities will be difficult to interpret- 


same has the v. 
that the test o 
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shall, nevertheless, calculate the required criterion in order to compare it with the criterion 
calculated on the assuniption that successive observations in the classroom are statistically 


independent, 

The estimate P’ of the row vector of occupation probabilities is the row vector of quan- 
tities (m; +n, )/(I +N). This is (0-047406, 0.07387, 0-27789, 0-17025, 0-22309, 0-20743). 
This estimate satisfies the equation P'Î = P'I to within one figure in the fifth decimal 
place, where Î is equal to 7 with (mj, + nj) (m; -- n; ) for ру. The estimate of the co- 


Variance matrix of occupation numbers for a realization of length М is, as in (11), 


0-06711 0-01035  —0:03182  — 0-031064 0.01934 — 0-03334 
000545 - 0-06514  —0-03412 0.03807  —0-04460 

0-76613  —0:14075  —0-31241  — 0-21599 

0.30988 —0-10109 —0-00227 

0.55204 --0:19644 

0-49266 


The matrix A required by (12) may be found by inverting, for example, the bottom right- 
hand 5 x 5 matrix in the above 6 x 6 matrix. The criterion (13) for testing the equality of the 
Sets of occupation probabilities is then 60-1. The y? criterion for the 2 x 6 contingency table 
of values т j, and a; is 126-0. Thus this example shows the great reduction that can occur 
11 the criterion to pum this hypothesis when independence of successive observations is 


"0t assumed and when pj; dominates pj, j not equal to k. This situation for s - 2 was 
discussed in 54-1. "m 

The assumptions of $3 are known to be approximately true for data of this kind. Hence 
We may obtain further information on the factors affecting the great difference between the 
Sets of Occupation probabilities by considering the tests described in that paragraph. The 
Criterion for testing the equality of the sets of conditional probabilities Vir J not equal to 
k, hag the value 95-0 on 24 degrees of freedom, and the criterion for testing the equality 
( f the Sets р,; has the value 66-0 on 6 degrees of freedom. Thus it appears that differences 
In the occupation probabilities are strongly affected by differences in the probabilities of 
Change from one state to another, and more strongly (per degree of freedom involved) by 


e differences in the length of time spent in à state before a change occurs. 
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OVERFLOW AT A TRAFFIC LIGHT 


FRANK A. HAIGHT 


Institute of Transportation and Traffic Engincering, 
University of California, Los Angeles 


1. LIGHTS IN GENERAL | T 
Suppose vehicles of uniform length arrive at an intersection controlled by a traffic lights benc 
arrival times which constitute a homogeneous Poisson process, with parameter A. The ieu 
of acceleration will be ignored; once а vehicle is near the traffic light it travels with a umi T 
speed S unless stopped. When a queue of vehicles is stopped by the light, headway (dista she 
separation between corresponding parts of adjacent vehicles) is constant, and when 
cars move off the headway will be a larger constant, denoted by K. scott 
The letter will be used to indicate the length of the red phase, so that the t re 
number of arrivals during a red phase will be AP for a fixed cycle light and AZ(/) ane 
variable cycle light. During the green phase, when there is a queue of cars waiting, na се 
discharged with time separation K/S = TT. Once the queue present at the beginning © sip 
green phase has been emptied, the Poisson arrivals continue through the intersection W! 
out delay for the remainder of the green phase. Instead of ch 


by its length, we will usé instead an integer N, w 
in its length: 


aracterizing the green Ju 
hich is the largest multiple of 7’ contain? 
length green = а = NT +07 (0 « 0 « 1). 

Thus, with a sufficiently long queue, at most N vehicles can be discharged during & green 


; " : Е ough 
phase. N is, however, by no means the maximum number of vehicles which may pass thro 5 
the intersection during a green phase, for on 


ce the queue is dissipated, the Poisson stra 
may send cars through with time Separation < Т. In some circumstances № will me 
constant, and in others a stochastic variable governed by vehicle actuation on the side atro? | 
ot refer to input and output to the intersectio" 
n. Thus, when there is n0 queue, a car may ict 
ving both input and output zero. It is essent 8 
to specify under what circumstances input is possible. It might be assumed that 5 1 E 


H В an 
flict with our experience and certainly leads 00 i 

: : А n 
will assume continued input in every circumsta 
except zero queue length. 


2. FIXED CYCLE LIGHTS 
Suppose //, T and N are constan 


ts. We will have occas Р robabilitY: 
which will be abbreviated “ton to use the Poisson p 


which will be abbreviated 


Riu; тр) = A(u, T) ери ри-т ш-т,ғ-1,...), 


ии а. 
= CS 
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where 
А(и,т) = ap 
Я qu r—1l 
This re dt | 
presents tl ili 
eus ee nci ind siame 1 that exactly u members of a queue will 
ме clar е аи beginning with r members апа with 6 si inten - 
p always прив and regular discharge Ino icati cnc 
ays equals AT Гарргохі 59. ur application, duri n in, 
~ са oA [appr oximately A/N], and will be omitted нна 4... 
Вета на. of this paper is to compute th ili Peng ind 
Ci e probability of Z cars being i 
preceding — red phase, when there were X cars in the ieri. " ix igi ifa 
flow must be Ре hase, Z will be called the overflow into the red phase if ne E 
pi " i H X 
zero, and will depend only on the input during the gree ^ spera 
green phase. Lettin 
g 


-— fe; =) = Pr[Z 2 2|X = 2], 
ave 
Whe /@;х) = Р@-х+ХУ;аА) (>N). 
ena < №, = 0. the fi ; т, 
үз йб 0. the formula is also fairly sim ple, and тті і 
orel-Tanner probabilities as гунны Ре наннан 


У 
/(0; 2) = У Rj: <) (= < М). 
den (2) 


< N), consider, for fixed v, the event that the overfl 

" i | t 
after the light turned red in the period that м 
of this is f(z; х) e^, and it may be 


would ec that no vehicles would arrive 
Tegarded as he tuns to clear. The probability 
overflow was х анты that thequeue would first vanish after № +z vehicles and that tl 
is of course pes pepper ба ee of the queue first clearing after N + 2 vehi та 
J= l to сс z; ж), and therefore the difference R(N +2: 2) — f(z; x) ер is н pu ШЫ 
of the probability that the queue would first vanish after а vehicles 

iss es 


апа 

that the overflow is j, that is 
2-1 
Gs 2) Re 
j= 


fro 
N . 
a which we obtain 


2-1 
" f(z; =) = e [R(N +; ®)— TRE ING a) (г2>02< WN). (3) 
0 Ж 
"quatio 
values fts (1), (3) and (3) taken together give a formula for f(z; т) in terms of the precedi 
3; €), P= 1,2,.-,2—1; for various values of 2 and т. In conjunction with inam 
^ ese 


хр 
ressions, it i 
ns i i 
‚ it is necessary to use the following conventions: 


А. Раз A)-9 for u< 0, 
в. R(u;r)-? fo 4<7, 
C. R(u;0)-9 fo и> 0 
сі fo 4-0. 

+N <2, and this is in fact an impossible 


The 
n it wi 
will be noted that f(z; 2) = 0 whenever 2 


Tansiti 
ion. The first two values of (3) are 
f(s =) = e R(N+1; т), 


у; а) = e [RIN +; п) –е RIN +1; =) 


499 Overflow at a traffic light 


r ients 
Jt is also possible to write each f(z; x) explicitly in terms of the Borel- Tanner coefficien 
A(z, =). If we define B(z,x) by means of the (2 — x)th order determinant 


А(2, 2—1) 1 0 - o | 
Alz,z—2) A(z-1,2—2) 1 " 0 | » 
Ва) = (-1y**| 4(52-3) Ай-1,4-3) 46-94-39) 2. 0 | ( 
5 H $ -— 1 
: : : | 
A(z, x) A(z—1,x) A(z—2,2) 2. А(Е+1,9) 


for z > x and B(z,z) = 1, then we can write f(z; x) іп the form 
fle; в) = $ eri pi ай RIN +j; з) 
j=1 
5 
= eon prie Веј AQ +j,") (z>0,{x <N). (5) 


e 
With a table of A(z, ж), the coefficients B(z, x) can be most conveniently calculated from th 
formula "ET (6) 
B(z, x) = -EAC Bij, а). 
=r 


The first few values of A(z, х) and B(z, x) are given below; to avoid awkward fractions, each 
value is multiplied by (z—1)! 


(2-1)! A(z, x) 
RC т | | | к | 
x | 
1 2 3 4 5 6 и 
2 
| — 
1 1 = — — — d pe 
2 1 1 = 
3 3 4 2 t ni cud = 
4 16 24 18 | = = wii 
5 195 200 180 96 24 — T 
1296 2160 2160 1440 ae 
600 120 
7 16807 28812 30870 23520 12600 4320 720 
(2-1)! B(z, x) 
Ж Eee] 
Ne 
( 1 2 3 Т 
4 
` N 5 6 
шы 
1 1 -- = әрі 
2 а 1 = = ара = = 
3 = 2 = = — i 
4 езі 12 -18 6 Ба кей эт 
5 і -32 108 — 96 7. = e) 
T 4 1 m Ji ed ed pe es 720 
| — 7680 9000 а 4320 
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To show that, for fixed x, Xf(z; =) = 1, consider 


o E . © 
ХезҢіз)- X RW +j; x)— EfG: а) (1 —e7?), 
ј=1 ізі 3-1 
Which is equivalent to x (j; x) = X R(N +j; <). 
j=1 det 

Combining this wi i 

g this with (2) gives the value unity provided x < N; in the i 
result is clear from (1). | quoe 
Nn. the red phase, the transition probabilities are simple Poisson expressions. In 
нэ езе and f(z; x), we can write equations involving other probabilities. For matos 
^ zy 18 the probability of a transition from x cars at the beginning of green to y cars at the 

ginning of the following green, we have 
Dey = Ру 2+0: (0+0)А) (=> 0) 

py n Esas Рила) ФМ m 
= Ply; AB) X іа) X Pos AB е) (2 < NY 

іст == 


1 


Also з 
қ letting 7 Pr (n cars waiting at beginning of green |t = о), 


n 
o, = Pr (n cars waiting at beginning of red |t = оо), 
we i — ig P 
have (if these limiting probabilities exist) 
n+N 
=> 39) п; 
on Eyes Jj) т; (8) 


ar n . 
па m, = X,P(n-J: АЙ) с;. (9) 


=й 


I 3. SEMI-ACTUATED LIGHTS 
и section we consider the traffic along a main street into à signalized intersection 
E en. by a light (possibly) actuated by side street traffic. When the light turns green 
is е main street, it must remain green for a minimum fixed period (the main street mini- 
Side ). After the expiration of this time the light will change whenever а car arrives on the 
bits street. Then the light will be red on the main street for а fixed period (side street 
А lal), and a further fixed period (side street extension). If any side street vehicles arrive 
ed the first side street extension the main street light remains red for а further side 
= es extension, measured from the time of arrival of the саг. This process continues as long 
ir arrive on the side street, or until a maximum value (side street maximum) is reached, 
n it turns green again. We will assume Poisson arrivals on both streets, and use the 


9llowing notation: 
A 
д = side street input parameter, 
а = length of side street initial, 
b = length of side street extension, 


N — number of cars able to clear durin 


main street input parameter, 


g main street minimum, 


of main street initial, 
um. 


a = length 
A = length of side street maxim 
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These quantities are fixed constants of the system; In addition, we need to define certam 
stochastic variables which will be used to characterize the etum, m 
В = length of main street red. Defined over a+b < р < A, with density un | ы. м 
— number of cars able to pass through a main street green period. к dn 
№, М+1, М+2,..., with probability distribution q,. For simplicity we w 
the number of ‘slots’ provided by the green phase. Е m 
Once explicit forms are found for g(u) and qn: the problem of determining pi is 
distributions of $2 is solved. For example, an equation like (3) would only nee 
modified to take into account the stochastic nature of the number of slots: 


Меза) = Ў er (nz; а) X Res JG: a. 
n-N і- 


istributi ; to obtain. 
In case the side street maximum is infinite, both of these distributions are ни to pn 
The red phase length distribution is given in a slightly different context by Raff : i 
and was apparently discovered by Garwood (1940). In the notation defined above, i 


һ-1 М —b(1 j \)Ji—1 [ —b(1-- i))]i 
g(u) = ш ae у -е-еу! (нес PEDI а 0) ы 
hb <ux<(h+1)b (h=1,2 ) 


„ду E ns 
The slot distribution can be calculated directly from the Poisson probabilities and tt 
out to be Gn =le (р = ) 


= -eTye-aTe-n (n> М), 
where о = NT. 
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PARTIAL TESTS FOR PARTIAL TAUS* 


Ву LEO А. GOODMAN 
University of Chicago 


l. INTRODUCTION AND SUMMARY 
In his 1942 paper, Kendall introduced and discussed the partial rank correlation coefficient 
бап (7). Since then, the difficulties involved in developing tests of significance for partial т 
have been discussed by Kendall (1948) and the sampling distribution of partial т has been 
Studied by Hoeffding (1948) and Moran (1951). This work has been briefly summarized by 
Kendall (1955) as follows: ‘No tests of significance are yet known for partial ....Hoeffding 
(1948) gives a rather complicated expression for the variance of partial 7 in the limiting 
Case when n [the sample size] is large....Moran (1951) has considered partial 7 without 
reaching any clear conclusions other than that the distributional problem is a very complex 
опе? In the present paper, tests of significance will be presented for a number of different 
Partial correlation coefficients that are closely related to partial т. These tests will be valid 
When the sample size is large and when certain additional assumptions are made about the 
Situations under consideration (see § 2). 
The partial correlation coefficients introduced herein are computed from ranked data, 
asis partial 7, and they measure various aspects of what is usually meant by partial associa- 


tion, For the situations considered in the present paper, some justification can be found 
artial correlation coefficients introduced herein 


less justification can be found for the use of 
in a certain general sense, related to these 
r the use of partial 7 in situations 


о В 
» Т the use of a weighted average of the p 

ап over-all measure of partial association; 
Partia] T, which is also a summary measure, 


Partial correlation coefficients. For some justification fo : я 
that differ somewhat from those considered herein, the reader is referred to Kendall (1942), 


and Hoeft ing (1948 В 
Тһе xd к ль coefficients discussed herein are related to, but differ from, the 
Seneralized partial correlation coefficients introduced by Somers (1959). The methods 
“Veloped in the present paper could, no doubt, be generalized in v E obtain е = 

Significance for various kinds of partial correlation coefficients. For the situa‘ ions considere 
lar partial correlation coefficients discussed 


In the rticu 
present paper, the use of the particu br bruno 
rein hag ж Ноева these coefficients can be given simple probabilistic and 


ары бери] interpretations, and the tests of significance developed for these coefficients are 

Quite easy to apply. 

ENT OF PARTIAL ASSOCIATION 

Parti f whether an association or correlation of 
ial association usually refers to the study oF W° ' | 

quality, Q wi ы y Пу due to the associations of each with a third quality P. 

T with quality R is really he matter by the consideration 


à ide t 
1801іев of partial association usually attempt to an inated (seo, for example, Kendall 
Subpopulations in which the variation of P is è mi , 


2. 'THE МЕАЗОВЕМ 


ados University of Chicago, under sponsorship 
arc ion in whole or in part is permitted for 


* 
of a Research carried out at the Statistical Rese pine a 


any Statistics Branch, Office of Naval Researe 
Purpose of the United States Government. — 
27 
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ЕЕ” fe i tain sense, 
tial association between Q and В, when the variation of P is, w a rea in Tasa 
vies oe be studied herein in situations where the statistical analysis is to 
iminated, w Е 
е he rank orders of the observed values of О, of В, and of P. seal чат ай ӨН 
Е > shall follow the usual practice of arranging the ranks of the observe E а a 
ел { ies present among ; 
е 1,2,3,...,n, and of assuming that no ties аге Į : 7 their 
their natural order 1,2,3,..., ‚а » ith observation (in 
observed values (see Kendall, 1955). Let P; denote the rank of the ith 


. y е 7e shall assume 
natural order) with regard to quality P, so that P,-ifori-1,2,...,n. We sha 
herein that the conditional joint distribution 


Pr(Q <q, R < r| P) = Fiq, r| P) 


: ng 
of the variables Q and R, when Р, is given, is continuous so that the problem of erre 
the observed values of Q and В can also be ignored. Thus, we assume that Q TO БАЛ 
continuous random variables. Only the rank orders of the actual observations О ‘ он 
(and not the actual observations themselves) will be used in the subsequent 5 p em 
analysis. The symbols Q; and В; will denote the ranks of the actual observations 
R®, respectively, associated with a given Р.. 


TF Q? and RO are statistically independent of each other when Р, is given, then 


Pr{Q® <q, RÒ < r| P} = F(q, r | P) 


: itional 
is the product of the conditional distribution G(q | Р) of Q given D, and the condi 
distribution H(r | P) of R given P; i.e. 


Tq т| В) = Gq|P)H(|P) (4 " 
"e ar 
In this case, by adapting the usual meaning of partial association we find that the enit 
association between Q and R is zero when the variation of P is eliminated in a mpra " 
but the expected ‘association between the agreements of [rankings] Q with P and 
R with Р? will in general not be zero. То see 


agree 
that the expected association between the a£ 
ments of rankings Q and R with P will in general not 


= 1,9; 7). 


be equal to zero even when 
Fiq, "| В) = G(q| P)H(r|P) (i21 
consider the situation where the followin, 

(I) the n pairs (09, рә 


12, азу), 


g two conditions are true: 


) are statisticall 


y independent Observations 
distribution F(q, r | Р) of each pair may depend on B) 


(11) Pr(Q > Q9 | P, В} = by Pr(RO > ір, Р} е; 
(Condition (II) attaches Somewhat more signifi 
usually done. This condition will be satisfied iis 
G(q| Р), where P, = 1,2 i i 


упал 
(the condition? 


1, the location parameter is case 
(r | P); this will often be Шы ауз 
membered that linearity also 1 (see 
ual theories of partial correlation for quantitative variables 
for example, Cramér, 1946). 


та ions (D) 
For а description of Some situations where conditio? 
and (IT) can be assumed to hold true, the reader 18 


feld; 
referred to Goodman & Gru? 


Ж ара " Р), 
1959.) Assuming conditions ( that, when Fiq, 7| P) = Gal P) HO | 
true: ы 


the following condition holds 
(Ш) В09> 09, RO > ROP pry с 


i-jCi-j- 
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Considering the 4n(n—1) pairs (0, Ri Р), (0 В» P), where i >), the expected pro- 

Portion of these pairs that are such that 0; > 0; and В, > В, (given that Р, > Pj) is 

а= X b, jc; j2/n(n — 1), while the expected proportion of these pairs that are such that 
і>) 


9; > Q,isb = X b; 2[/n(n — 1), and the expeeted proportion that А 
і>) 
= 5 ег 2] n(n – 1). If the expected association between the agreements of rankings Q with 
2 


P and those of R with P were zero, then the expected proportion of these pairs that are 
Such that Q, > Qj and Р, > R; would be be. Since a is in general not equal to be, the expected 
association between the agreements of rankings Q with P and those of R with P will differ 
from zero even when the partial association (in the usual sense) between Q and В, given Р,, 
18 zero. Since Tor, р measures the intensity of association between the agreements of Q with 
P and those of R with P (see Kendall, 1955), justification of its use as a measure of partial 
Association between Q and R, given Р, is doubtful for situations of the kind considered 
herein, Tt is only in the special case where X bilei- ;- =9 that the expected association 
between agre i ue iation is zero. Thus, т. does not 
greements is zero when the partial associa > Тов.Р 
measure the agreement between Q and R independently of the influence of Р; except perhaps 
In situations where the ‘covariance’ between bi-j and с; ;18 zero. The study of the partial 
association between Q and R, given P. is usually of interest in situations where the 


Covariance’ between b ij and ¢;_; is not zero. | . қ 
Assuming conditions (1), (II) and (IIT) described in the preceding paragraph, we see that 


for the (n. — 1) pairs (Q;, Ri, P), (Qj. Bi В). where i = j + 1, the expected proportion of these 
Pairs that are such that Q, > Q, and R; > В) (given that F; i D) is a, = бус, while the 
9Xpected proportion of these paire that are such that Q; 7 Q; is by. ane the expected pro- 
Portion of those that are such that R; > 2; is с. Since а: = bye Шы ve eens 

tween the agreements of rankings Q with P and those ша wiih. e 1 ды det P. Seria 
M Observations where i = j+ 1 are compared. will be zero when the par E wn тае 2% 

16 usua] sense) between Q and В, given P,, is zero. Thus, pee Із wh? n i 
Use ofa measure of the association 7; between the agreements of ranking: 


Serv e iving joi istri i f the agreements 
P i iving the joint distribution of t 

R ith 7 ‚а obs red in thi ers t i L ' measur fanas ect of thi par jal 
пете? =. ,asa mea 8:0 p e t 


for t 
he (n — 1) pairs of observations W. 4 re. The measur 
5 vat ME re . The sure of 
aSsociation ннн 0 and В, given P, in the situation considered here 
ас 2 E 8 n — 1) pairs о: 
aSsociation T, based on the fourfold table for the (п jj meet tion of partial 
6 ould be computed in an analogous faction’ "s peri dh i > j (see Kendall 1948); it could 
ой a serva sions W. j s 
: оа table for the 1n(n.— 1) pairs of ob tion similar to that given to such measures 
given a probabilistic and operation® but somewhat different from, the 


9f associato d als & Kruskal (1959)). 
n for fourfold tables an А > dal (1958), Goodman 
11 (1955). Kruskal ( ta measure of an aspect ofthe 


Чала interpretation for т (see Kenda oft 
У an approach similar to that preset "b a measure of the association 
Partial association between Q and R, given P 5 09 s R with P, as observed in the 
7% between the agreements of rankings Q with 6: тееп! (в 2018 ii шее 
Surfold table giving the joint distribution of pepe ae distribution of the 
Tons where 7 — 7+. In general, the fourfold ded = j+ k (where Lisa fixed constant), 
*8reements for the (n — #) pairs of observations Where o ments that will measure 
AN be used to obtain a measure of the association Tr р. The symbol k can denote any 
N aspect of the partial association between Q ал! 


fobservations where? = j+ 1 
7 based on the 


agree 


iven 
dR.8 ij 
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between 1 and n — 1, but when Ё is close to n— 1 the frequencies appearing " 
2. а "asi be too — For each k, a different fourfold table will be boim 1 
2. measure of association ту. will also be obtained. Each of these measures 
^ iv ilistic and operational interpretation. | 
E дес aie det by Kendall (1948) which is used to define partial т, 15 ken 
table obtained when the corresponding entries in the separate fourfold tables, compute 
for each fixed К, are added together for / = 1,2, 
distribution of the agreements for the 1n(n — 1) 


не 

k > 0, and partial т is a measure of the association between agreements in this table. ed 
the table obtained when the corresponding entries of a number of separate tables are ш E 
together need not indicate independence between the two qualities deseribed by the ud 
even when there is complete independence in each of the separate fourfold tables, par тү a 
шау differ from zero even when the measure of association т, computed for each o | x 
separate tables (for = 1,2, )is zero. Some justification can be found for basing a meast 2 

of partial association between 0 and В, given P, опа weighted average of the measure? 2 
association r, obtained from the separate fourfold tables (k = 1,2, ...), but less айоо 
can be found, in the situation considered here, for the use of partial т, a measure of ager 
obtained from a fourfold table, which is a weighted average, in a certain sense. of th 
Separate tables (exceptin the special case where the‘ covariance’ betw 

For the situa 


tion considered herein, the difference between the ran 
ful. This meaningfulness leads in a n 


қ ioint 

eon—1; i.e. this table gives the ү қ 
è + A О: 

pairs of observations where i = j +/ 


is zero). 
cen b; , and c;..; 1870! | 
D > is meaning 

ks P, and P ist 


s fold 
atural way to the introduction of the separate fourfo 
tables and the corresponding measures of association т. Althor 


igh no simple tests » 

significance are yet known for 7, it will be possible to develop such tests for the 7; in 4 " 
present situation (see 5 3). For situations where the difference between the ranks 7 > and 1 in 
not meaningful, some justification сап be given for the use of partial 7 (see Kendall (1947 
and Hoeffding (1948)); less justification can be found for the use of T, in these cases. h 

For some further discussion of the relation between ordinal measures of association, 806 


қ 22.42 ch 
ав т or partial т, and measures of association computed for cross-classification tables, 8" 
as the fourfold tables considered here, the read 


у ставка 

er is referred to Kendall (1948), Krusk ЖЕ 
(1958), Goodman & Kruskal (1959). A discussion of measures of partial association CO! 
puted for cross-classification t 


ables will be found in Goodman & Kruskal (1954). 


3. TESTS оғ SIGNIFICANCE 
Let us first consider the fourfold table giving the joint distribution of the agreements of 
rankings Q with P and those of R with P, for the 

i=j+1. 


ге 
(8-1) pairs (Q,, Р, P), (0,8, 8j), vibe" 


› У ф= 1 йк do ues ағы 
ате not statistically independent However, assuming Du n pee rn dien in 
the preceding Section, the $(®— 1) pairs where j = Jt+landj — 13 = — 2 (in the case 
where n is odd) are statistically independent 4 fs i 
where i = j+ Та; 


airs 
і given. the P. and so are the 100—1) P^. 

nd j = 2,4,6,....п—1. Thus, i r ae i 
also assumed, then the usual lar: 


tables can be applied in order to 


Tuy = (à, — 61,84) Кт 0,01 —5,,) (1 —64)) 
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is normal with zero mean and unit variance, where @,, is the proportion of the 4(n—1) =, 
pairs (2 =J+1,7 = 1, 3.5. 2) that are such that ©; > Q; and R; > R;, b, is the pro- 
portion of these ту pairs that are such that Q, > Q;, б is the proportion of these n, pairs 
that are such that В, > R; (see, for example, Pearson & Hartley, 1954, p. 71). Similarly, 


the asymptotic distribution of 
Tis yn; -(бһ- бызбы) [бэ ё ( 1 — b12) (1— 6,4) 


is normal with zero mean and unit variance, where йу is the proportion of the т pairs 
(= l, j = 2.4,....n—1) that are such that О, > Q; and R, > R;, б» is the proportion 
of these 74 pairs that are such that Q; > Q;, ĉi is the proportion of these n, pairs that are 
Such that R; > В,. Thus, a test of the null hypothesis that condition (ПТ) is true can be 
based on either т; y/n, or on тү» үү. Since these two statistics are correlated, they cannot 

9 combined in the most direct fashion to form a single test of significance. However, it 
can be seen that the covariance between (à, бабы) Vm and (@,>— 615615) Yn, is asymp- 


totically ‹ 
Otically 21, 91, where fed, d= Реда > > ба 


2 : m 
g=- and e = Pr{Rin > В, > Riab 


d i i c Gr $ 5 

i Suming that d, and e, are constant and do not depend on i (see Goodman & Grunfeld, 1959). 
“Us result can be generalized to situations where d, and e, are not constant, but the general- 
lation will not be presented here (see Hoeffding & Robbins, 1948, for some results related 


to this реро r is re concerning the asymptotic covariance 2/01, we 
generalization). Using this result g 


find that tho distribution of 
z = (â 5,6) J((n — 0/(9,&0 — 81) (1-4) + 35.0 


18 Normal with zero mean and unit variance, where @ = (ду + 41») is the proportion of the 
ы. 1) pairs (i =ġj+1,j = 1,2, 3,....№-— 1) that are such that ©; > Q; and R; > R; 


5 б, = 16,66), & = +): Л= 4-5 $»-4-8 

% is the Proportion of the (n — 2) triplets {Qir @ Qi) that are such that 
Qia > Qi > Qi» 

"Ча ig the proportion of the (n — 2) triplets (Ғы. R; В, 1} that are such that 

Rya > В, > Ria 

and also for some applications of this 


(see с ы 
ee & Grunfeld, 1959, for a detailed sion ts of rankings Q with P 


result) х А zeen the agreemen 
- Since the measure of association т; between ag) ie gu e 
ала those of R with [ym observed in the fourfold table giving the joint distribution of the 


а Я е = 24-І із simply 

Steements for the (n — 1) pairs of observations where i = j + 1 is simply 

n = @, 5,6) (60.75) 150) 
hi rmal with 
ae that the asymptotic distribution ofz n /{(®— 1)/h,) is normal with zero mean 
Я uni TS pape 53 " 

T “variance, where р L урой 60-09 0 601. 

hus Ls III) js true can be based on Ті Or, more 

s а test of th ll hypothesis that condition ( 5 | Е i 2) 
precisely on i iho For the situation considered here, this test of Significance 
s DW s hi vile "Ca е4 1 vse + of the hypothesis that the partial association between 
1 provides a partial te: ; А 


an В А 
> &iven Р), is zero. 
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Let usnow consider the fourfold table giving p i oint RA BRI ^ v^ [oomen ы pel 
гі — 2) pairs (Q;, Ri Pi) (Q; Bj F5). 

pee pei M. ан беле sire Mon cannot be applied in this case. 2. 
те aces 3 y enm (I) and (IT), the 1(n — 2) = n, pairs (i = j + 2,j = 1,2,5,6,9, 10, 13, pu 
5... here т, is even) are statistically independent, given the P, and so are n 
Б; ricus ie в = hid j = 3,4, 1, 8, 11, 12,...). Thus, if condition (III) is also mcr 
cam 7 sample uns for tests of independence for fourfold tables can be applie 
order to prove that the asymptotic distribution of 


Tai л» = (Ao — бё) A naf0,, 6,1 -ba) (1-6,)) 


m 
is normal with zero mean and unit variance, where â is the proportion of т. бұй, 
(i =j+2,j =1,2,5, 6,9, 10, ...) that are such that 0; > Q;and В; > Ej by is ip m ám 
of these т pairs that are such that (0; > 0; б is the proportion of these n, pairs 
such that R, > R}. Similarly, the asymptotic distribution of 


Тл» = (5, — б.» б.) хтб 001 = б) (1—63)j 


я А ' ogous 
is normal with zero mean and unit variance, where 0, by. and ё, are defined in an pei а 
way for the n, pairs (i = j--2, = 3, 4,7,8, 11, 12, ...). The null hypothesis that apt 
(ІП) is true can be based on either Toy t; Or ON т, үт. It can be seen that the cova 


between (451 —byy 64) л» and (8-0,0) 5 is asymptotically 97,0, where 


№=4,-8, d, = Pr (Qs > Qi» 0:5}, 
9а = 65—08 and е, = Pr Gu» R; > Raj, 


assuming that 4, and e, are not functions of 2. 


É rhat 
(This result can be generalized somew 
but this generali 


zation will not be presented here.) Also, the distribution of 


£a, (à, — 8,6.) Jt — 2)/,6,(1 ЕЯ) (1—6.) + 2,9 
: is the 
can be seen to be normal with zero mean and unit variance, where à, = M05 + Âo) 18 and 
proportion of the n—2 pairs (i =j+2,j=1,2, ...П—2) that are such that Q; > 0) 
R; > R; а 
6, = 10, +649), ĉa = 1636, fy = d,— bs, 0 = &—@, 
d,is the proportion of the (n.— 4) triplets (Qus. 0, 0; ә) that are such that Qus > Qi d oos 
and é, is the proportion of the (1-4) triplets that are such that R, 42 > В; > Rips Smo? f 
measure of association т, between the agreements of rankings Q with P and those 9 
with P, as observed in the fourfold table giving the joint distribution of the agreemen 
for the (n — 2) pairs of observations where j — j+2is simply 
Тұ- (0-5,6 %,25,(1 —5,) (1 ~@,)}, 
H n . H d 
we see that the asymptotic distribution of Ze = Tay/{(n—2)/h,} is normal with zero me? 
and unit variance, where 


hy = 1 [27.96.61 -5)0-&. 
Л л 
Thus, a test of the null hypothesis that condition (ПТ) is true can be based on 72 De 
Ts A ((n — 2)/5,), Which provides a partial test of Significance of the partial associat! 
between Q and R, given E. 
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Let us now consider the general case where the fourfold table giving the joint distribution 
of the agreements for the (n — k) pairs of observations (Q;, R;, P), (Q; В, P;), wherei = j+ k, 
1s of interest, where + is a fixed constant. The special cases where k = 1 and where k = 2 
were discussed above. The measure of association т, between the agreements of rankings 
Q with P and those of R with P, as observed in this fourfold table for the (n — 0) pairs of 


Observati is si , a ^ 
Servations, is simply т, = (8, —5,.0;) JB, 6,0.—5,) 01—6,. 


Where 4, is the proportion of the (n — k) pairs that are such that О; > 0; and R; > R; 
bp is the proportion of these (n — k) pairs that are such that ©; > Q;, and ё, is the proportion 
of the (n — k) pairs that are such that А, > R;. We shall assume that the following condition 
holds true: 

(Па) d, = Pr(Q;,, > Qi > Q; 4] and e; = Pr {Risp > Ri > В; ,) are not functions of i 
(this condition can be weakened in order to obtain somewhat more general results (see 
related results by Hoeffding & Robbins, 1948)). 

By an argument similar to that presented earlier herein, it can be seen that, under con- 
ditions (I), (II), (Па) and (III), the asymptotic distribution (n + co) of T}4/{(n — К)/й,,} is 
normal with zero mean and unit variance, when 

hy = 14 [2,016,001 6,0) 01-20 Л.= dy, — 83, 9. = êr —G, 


d, isthe proportion of the (n — 2h) triplets (Ө; Qis 9; +} that are such that ©, > Q; > Qir 
and é, is the proportion of the (n — 2k) triplets (В, Ri, E; ,) that are such that 


В > Ё > Е 


The quantity f, is an estimate of the ‘serial covariance’ of order Ё in the sequence of random 

Variables ROA 1—01), Slan @) S(Q,.. — Q3). «++» and д, is an estimate of the ‘serial 
e , +2 , 

Covariance’? of order k in the sequence S(Ry41 — Ry), S(Rr+2— В), Sis Вз), ез Where 

(X) = l when X > 0 and S(X) = 0 when X < 0. A test of the null hypothesis that con- 


dition (ry I) is true сап be based on ть (i.e. on ть (n — 0/0), which provides a partial 


tost of Signific artial association between Q and R, given Р. 
The fioe v all Ai e been applied, in the special case where k= 1, by Goodman 
Grunfeld (1959). The reader is referred to this article for a more detailed discussion of the 
Special case, and also for some numerical examples illustrating the p Pe of the Ее 
ased on 7, to the analysis of comovements in time series. The tests based on т, (for 


= I, 2, ...) described herein could also be applied to the analysis of co-movements in time 


Series ; "a ү i a 
1168 in а manner similar to the application of 7, 
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MOMENTS OF ORDER STATISTICS FROM 
A NORMAL POPULATION* 


Ву В. C. BOSE} лхо SHANTI 5. GUPTA} 


University of North Carolina 


1. INTRODUCTION AND SUMMARY 


pes Med on ordered observations have been called systematic statistics by Mosteller 
Bechhofer a = now being increasingly used in new statistical procedures (Bahadur, 1950; 
1949: Gu : мак Bechhofer, Dunnett & Sobel, 1954; Beohbofer s Sobel, 1956; Godwin, 
Dips, : NN 1956; Mosteller, 1948; Nair, 1948; Paulson, 1949, 1952; Seal, 1955). The present 
а sam ray S with the problem of obtaining the moments of Хи». the kth order statistic for 
among eil size n from a normal population Х(0. 1). This problem has been considered 
ojo de hers by Cole (1951), Godwin (1949), Hastings. Mosteller, Tukey & Winsor (1947), 
925) 931), Jones (1948), Paulson (1952), Ruben (1954), Teichroew (1956) and Tippett 
go been shown that x(n. ©), the tth moment of X са 
8 of order (—2i (i = 1,2, ..., M or 44-1) and the integral 


an be expressed in terms of lower 


рае e M07 da, (1-1) 
Wher -o 
ere P, (а) for > 0, is defined by 
E а қ” . 
Prax) = Ё Canes *], (1:3) 


Paris understood that in (1-2), Ф is replaced after differentiation by dz), the cumulative 
if ution function (c.d.f.) of N(0, 1). Pœ) is thus a polynomial of degree (n—=t) in Ф(ғ) 
м ^ and is zero if t > п. Exact values of all odd order moments can be байна when 
( in › and the exact values of all even order moments can be dedved янда n < 6. Godwin 
) and Jones (1948) have given tables of exact moments jin, k) for t = 1 and 2. The 


Corp, | ; ( 1 | 
ропа g tables fort = 8 and 4 &re provided in this paper. In general the numerical 
an be expeditiously done by using the Gauss method of 


based on the zeros and the weight factors of the Hermite- 
provided by Salzer, Zucker & Capuano (1952). 
wever, be retained in the computation, 
1 the final moments about the mean. 


а R 
"өү of the integral (1:1) е 

о Mosq. Co (Szego, 1939) 
иен for which tables have been 
to Secure y large number of decimal place 
an adequate number of significan 


s must, ho 
t figures ir 


Let 25. 2. THE FUNCTION 1 (т, К, =) 
ea be n indepe 7: om ] po pulation 0, 1) with zero 
i rvati у а погта P N 
ы n "e я E т р ndent observations fror Т 
| i ә. 
| Е s va) < 20 S. < Yo) (2 1) 
2 eS 


ir Force through the Air Force Office of 


Command. Reproduction in whole or in 
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ive distribution function 
nked observations among 21, zs, .... Xp. Then the cumulative distribution 
be the n ranke: жй ' ; т 
f Xp, the random variable corresponding to жу, (1 < k < п), is given by 
o (k)> 


Рип, k, x) = Prob (Xo; < а) 


4 * 2 2.2) 
= aot ФЕ! —Фш) кезед, | 
БІНЕН 
1 е) 2 (9:3) 
where Ф(т) is defined as P(x) = Omi fe dx, 
n! (2:4) 
and C is the constant T es е 


(k— 1)! (n — k)U 


P қ .) for con- 
Let us now define the function P(n, Е, з), which we shall abbreviate to P(x) fo 
venience, by the relation 


„dP, (25) 
Paila) = (2m) de, 
where P (x) is given by (2-2). Then 6) 
2. 
Py) = Clb (x) [1 — (a), 


Р ; > і>т 
Tt is clear that Руа) is a polynomial of degree n —t in D(z) if 1 < t < n, and is zero for 
In fact, we can write 


д е (27) 
Р,.1(2) = Caio - Фуі-41, 


; ;, Ф) 
where Ф is replaced by (x) after the differentiation. Tt follows that for given 1, 7, Ае: 
is а bounded function of x. Тһе function 


6 elow 
в Р/а), Рх), P(x) and P,(x) are given b 
explicitly, where Ф is written for Ф(ш). 


Руа) = ОФе-щ1 — yeu 1) 


-(n-1)9], - 
Plat) = CO1 — yc туь эу 73(k —1) (n —2) P + (n — 1) (n — 2) Ф]. po 
Pie) = COA — O)r-k-3(( 1) (9) (5—3) -3(:— 1) (£72) (n —3) p 

+31) (0—32) (n—3) 92 (»— 1) (n—2) (n.— 3) Фа], (2-10) 


Б/ш) = ОФе51 — ф)и-ь-а 6-1) (2—2) (6-3) (k—4)—4(5—1) (2) (k - 3) (n — 2 ы 
0670213) (8—4)92—40. ауа мы ш фз 


2.11) 
*(n-1)(n—2) 3) (n—4) qs], u^ 
3. А SYSTEM ОҒ DIFFERENTIAL EQUATIONS SATISFIED BY Руа) 1) 
From (25) Pie) = maet! И 
d 


ad 
Руа) = (2i ede 22 (2m ae] 


= (22)e* | №, ар 
d. 


d (3-2) 
жа тр |. 
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In general let us assume iar 
d-tP, 
P(x) = одне" Sy, rale) = dy? 


Where ды(и) = 0, (3:4) 


(3:3) 


and g, () is a polynomialin z of the rth degree. Differentiating (3-3) and using (2-5), we have 
ti drap. , LO dB, 
Б, (а) = (2л)%@+® ten x [ао em + (tg, (=) +09, (а) r^ А (3-5) 


This leads to the recurrence relation 


d 

#м+а(®) = gei азға; Joa. ie), (3-6) 
Where 9.,(x) should be interpreted as zero. This together with (3-4) determines all the 
Polynomials 9,,(%). Starting from k= (8-7) 

We can successively calculate 
Jolt) = 1, gy (x) = 2. (3-8) 
#о,з(ж) = 1, 91,3(=) = 33, — 95,3(2) = 22? 4 1, (3-9) 
9а(%) = 1, gy (е) = 6a, Joal) = 11a? +4, 92 = 629 +72, (3-10) 


9,5(%) = 1, 915(2) = 10%, gə sX) = 352410, 93,5(=) = 5048 + 45x, 
94,5(%) = 2421 +46? +7. (3-11) 
Hence we have the set of equations 


ару” е? P (x), (3:12) 
ат di 
n) e Tnt * P(x), (3-13) 
= й d dab (Bala) 
en P, en жыры (3-15) 
CNN pa) 179 4 (24t + 4602+ T) 3 "ел ior е Pia), 


(8-16) 
We can proceed in this manner up to any order, but it should be noted that P. (x) is a 
“onstant, and P(x) = 0 ift > n. The general equation is 
езін Руд). 


ар, 
90, шта dii 2 d nt a) Eau) т T ~ (2л ar (3-17) 


436 Moments of order statistics from a normal population 


4. MOMENTS OF Xip 


We shall first prove the following Lemma: 
Lemma. If « and ғ are non-negative integers, then 


+= drap , s 
| aya @ = (—1)/'а(@—1)...(@—г+1) Har or 0 (#1) 


-o 


accordingasr < «orr > о, where jr, ,isthe (x —r)th order moment of Хо) about the origin. 
It should be noted that by definition 


Гаа =: (2) 

From (3-12) and (3-13) - _ L „фар, (4:3) 
dz (Эл)! 

d?P, x Š 1 š E 

Te == rji е? P(x) + ae P(x), (4-4) 


and in general using the system of equations (3:12)-(3-17) we can write 


ФР, т Bs 5 
= ha) t Р ue 
Tet = E Moyen Pa), ( 
where h,(x) is a polynomial in x. Now 
+0 drap, „Роне ЕС uP 6 
ГЕ % dari dag Б d -% 7 Г” aa оф. " 
Since P(x) for any non-negative integer t is a bounded function of z, it follows from sie 
the first part on the right-hand side of (4-6) vanishes, Repeating this process We ge 
Ппт”<оа 
+o „аР, += 
n > Tan & —(-1ya(a—1)...(a —r4- nf zh 
If r > а, we get, on repeating the process o times, ты a pers 
+o drip, + 
% = (—1\« “дч 
w a — dae = (-1)*a(a—1)...3.2. if" Ftd 
-ар |+® 
| = (-1*a(g—1 о 1 Сам = 0. 
This proves the Lemma. ISl dare |- 
On applying the Lemma and integrati 
t , 
grating the equations (3:13)-(3:16) we get 
ч 1 [+o я 
=z ж Py) е- ах, pl 
—34(9v41;-—1 [*9 8) 
Omt D = ary Т қеледі, * 
— 22! + (61! 4- Tu! =. 1 += 4:9) 
Hy + (биз + Tpi) (87) М Ра) e-22* da. б 
70- 1505 —45+ (24и, + 4j 4 пу ы. (410) 


(27i J_ P5) et da. 
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We ВИ ИЕ | ) 

m may write (п, k) instead of x, to denote the fact that we have the «th moment about 

| e origin of the kth order statistic out of a sample of n observations from N (0, 1). We then 
ave 


gu(n. k) = x | | P,(x) "ах, (4-11) 
, 1 ей —$ 22 Jy ә 
(п, k) = 1+ ол и Бүт)е-і dx, (4-12) 
1 +o ый 
yaln. К) = Sui, k) + zem. Р(а)е- ах, (4-13) 
, бя 1 TOM Ay 
jin.) = фи E) тусу [= Po) ec? da. (4-14) 


Fs general applying the Lemma to (3:17) we can express y(n, k) in terms of lower moments 
eve 2 : А 
еп (odd) order when / is even (odd) and the integral 


(4:15) 


+o А 
| P, (i) emi da, 
Ј- 


where | 
here the polynomials P,(x) ... P(x) are given by (2:8)-(2-1 1). In the particular case when 


w= hi he Vn 13 the largest of ті. ds... Cne P(x) assumes the very simple form 
Pla) = n(n—1) (n7)... (n -t 1) [D(2)]". (4-16) 
NU sen 
Чопо ^ 2 “2 Am- о da: 417 
€ we get (т.т) = = NI e? da, ( ) 
EMIL. 
(6) 30-і (4-18) 
2 =1+ F [0(2)]^ e7** ax, 16 
(т.т) 1+ (mi Ен 
n\ tt? 
4 n—4 e- da, 4-19 
изт, m) = $us n) ШЕТТЕ тн [0(2)]"7* 7*7 da. (4-19) 
И, 
MU (x)? e-t de (4-20) 
aln, 2) = ит”) + gr m (x) 


ar Should be noted that in the formulae (4:17) to (4:20) [0(2)]* should be interpreted as 
9 ift; n 
Е і . + 
Bs Integrals of the type occurring in this p: 
€ (1931). Equations (4-11), ..., (4-14), togetl 
г correspond to equations (19), ..., (22) on рр. 


foll 

owi pp | 
Wing correspondences between our and Hojo's paper | 
п Е т = no. above Xu» 


o's equation (19) and so on.* 


aper have been numerically evaluated by 
her with equations (9:8),..., (9-11) of our 
334-5 of Hojo's paper. If we make the 


оц 6—1 > p = no. below Хо» 
Te А | 
чаша (4-11), together with (9-8), reduces to Hoj j | 

С .'Hoio оп рр. 325-6 will provide values of 

nd E tabled by ш. — 13 inclusive and the 3rd and 4th 
eee m only calculated to 8 decimal places 
A bon the effective number of 


* 
[Т 
the EE appears that the functions T, I, S & 
es in 2nd moments of all order statistics © 
"et h 8 up to п = 10 inclusive. His functions wer 


Ы 1 + the mean, 
en the raw moments are converted to moments abou 
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4 
5. EXACT VALUES OF SOME MOMENTS 
. +o i (5:1) 
І.(а) -| [Ф(а2)]" e dx, 
= : һы (5:2) 
= ті 
then - Ца) = 7. | ен 
N | [G (az) — т e ах = 0, 
ом ВИ 5 
since the integrand is an odd function of х. Hence 
кн. лун (“+ ) Ти) pü 
Lmsla) m or = . еб 
In particular L(a) = 1а) = іт”, (5-6) 
п Һа)-,а)-321,а)- у/а)- Та) — а). 


5 ., (а). 
In general, J,,,.,(a) сап be expressed as a linear function of I,, (a), am—2(4), yest? 
Differentiating (5-1) with respect to a (this is justified in virtue of the uniform 


E integrands)» 
of the integrals with respect to a, —co < а < oo, and the continuity of the integ 
we get, for n — 2, а, 1 a 


ence 


(5:7) 
da ~ m (+2) 
a E (a? 4- 2) (a? 4- 1) (5% 
80 that (а) = =ратеќап (Via? + 1]). 
3 d (5:9) 
Using (5-6) (а) = 4810 tan (J[a? + 1]) – 22%. 
Since Р, 


9): 
.3) and (5 
н1(2) is a polynomial in Ф(>) of degree n — t — 1, by using (5:2), (5-5), (5:8) a! 


га 
“а, k) for 
we can exactly evaluate (4-15) if n <t+4. Hence we can exactly evaluate /4(7» 


48) 
Е s (19 
odd values of t, if » < 5 and all even values of t if n < 6. Godwin (1949) and Jone 

have given tables of exact moments n 


and 4 are given below. 


м 


—_\“ 


f pu 
for t = 1 and 2. The corresponding tables fo 


Table 1. p14(n, k) 


2 
k=n 


2 2A = 1-41047 39589 — 
3 3A — 2-11571 09383 
2B +20 = 9.70042 57044 124-68 60 = 0-36156 66401 | 


| —54+5B+45C = 3-22487 93638 


0 
204 —10B — 10C = 0-60261 10668 


5 
Here: А =— = 0-170523 69794 35, B=— | =0 9 34 
An 2am! = 0-06349 3635 > 

15 
c= ‘gyi te tan (/2) = 1.28671 99169 55. 1 

4 , 
m | : 54. 
significant figures is very much reduced. As М. <. Kendall has pointed out (Biometrika, 19 facto" 
| г his S-functions as printed appear to be in error by the omission © 
of 27 which seems to have been included correctly in his moment calculations.—Ed.) 


аи CTC 
——— ] 
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Table 2. (т, k) 


| 
| 


k=n | Е=п-1 Е-тп-2 
— |К ж 
3-0 = 
3 Р 
3+a 3-24 = 
| = 4.19454 59401 = 0-61090 81198 
4 
| | _ 3--2а | 3—2а — 
= 5-38909 18802 | = 0-61090 81198 
5 | 3+6 | 
| ›+с | 3-1 10a — 4b — 4c 3-20 
| в а + 6b - 6 
| | = 6.52339 54801 | — 0.95187 74565 | = 024945 pie 
| 6 | A | | 
| 3— 5a 3b - 3c | 3425a-9b-9¢ | 3— 20a 4- 6b + 6c 
| | = 7-59745 67578 = 1-15308 91273 = 0.24945 41149 
65 
Вет = 119454 59400 81, b= gri? tan (y$) = 346675 52225 38, 


уй = 0:05664 02635 46. 


== 


4m 


The a 
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к svens 
i o 7 к k, 1956). Steven 
Р : tervals (Еп4еу, 1949; Blank, ud 
biased system of confidence in a À І а 
the 5 a practical procedure for calculating such a emn e adi и epum 
iari i Although his procedure is short (of the orde 1918 
Е ial or Poisson parameters. Althoug p ites геі 
bes pecus the present writers), the almost universal occurrence of the i anta 
А ria : niv | 4. 
i im to warrant the publication of a table giving an optimum interval directly 
see 


2. А PROPERTY OF POISSON PROBABILITY SUMS 


Ж 1 ех rectation 
The Poisson distribution is the distribution of a random variable c which has the ex] 
m and the probability eimi 
pm) = 


(1) 
у @ = 0,1,2,...:3 am > 0) 
UH 


4 : 1) has 
of taking on the value i. We note the well-known fact that for fixed m the sequence ( 


2 А + attained by two 
a unique largest number except that when m is an integer the race us is ra he! mr 
consecutive members. On each side of the largest member(s) p,(m) is a strictly 
function of i. 

On the other hand, a sum S pm) (e <) 
i-e 
с aximun 
over a fixed set (c, ej -- 1, ..., сз) is an analytic function of m and has a unique тах 
for m satisfying Ж (2) 
а pum) = ?а-((т)-р,(т) = 0 (e, > 0), d 
=, * 

that is, for т = Mees = [сү(сү-+ 1)... с.) а-а), 
(Equation (3) applies for Сіз 


nn 
0, as does (2) if p. (m) is defined to be identically 
Since p, 1(т) = pe 


(m) according as m En ey (сі > 0), it follows that 


4) 
6:-1 €i ( 
У эт) = У Р (т) according as т E meo 
іше-і ісе с that: 
We note that т, е increases as either one of its subscripts increases, From (4) we se 
for m between Mey1,e-1 and m, 29 
с:—1 Cy Crkl 
У p,(m) > Ур(т) > У pim) > .... 
zc i-c-1 i-o і-сі-1 
Also for this interval 
6-1 с.-2 с:—3 
„Ў Рт) > X рт) > € рту»... 
i-e-l i=c,-2 t=¢,-3 
Hence for fixed r 
err er ) А (5) 
a. PE = = рт) (ns, ces ст $ шна ынчы? с, жа 0, 1, адын 9, » . ! 
and these maximum sums form a non-increasing function of m, as illustrated in Fig. 1- | 
3. CALCULATION OF THE TABLE " 
Тһе calculation of Table 1 printed оп PP. 448-53 below is of the same general nature kr m 
described in detail by Crow (1956) for the binomial parameter. Тһе main body of sapiat all 
was performed on a high-speed electronic computer which calculated in the pre table 
required terms of the Poisson distribution from (1), without reference to existing a sk b 
The confidence limits for ¢ — 0 (1) 20 were checked by independent ealeulation using | 


inn cdi 
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calculator and the tables of Poisson individual terms and sums of Molina (1942) and Pearson 
& Hartley (1954), and Kitagawa's (1952) table of individual terms. Both calculations were 
made to three decimal places of the parameter m, but accuracy of the last place was not 
always sought in the desk calculation. 


"ГГ пои 


mo, , mi, ced M, г+2 03, г+3 T4 г+4 
m 


Fig. 1. Poisson probability sums illustrated with r — 3. The characteristies are general except 
that for r = 0, c = 0 the slope is negative at m = 0. 


" The calculation of a system, say ô = б(с; т, є), of confidence intervals for m with con- 
4 2 Я В H 
dence Coefficient 1—¢ from observations on the Poisson variable c reduces to the deter- 
mination, for each m, of acceptance regions А(т) consisting of consecutive integers 
à 8 


6 x 
LE, т, +++, Cy Such that Pa 6 
X pm) > 1-е is 


et 

“ng Satisfying alternative further conditions sufficient to define the system uniquely. The 
Central Confidence in: А > 
Me le 1, defineq by the two properties stated in $ 1, will be denoted by 8, (онбир КЕ 8 

ation for the analogous binomial intervals). The system à; is defined by Sterne's (1954) 
condition that e 
an imme iate consequence of this condition is that the corresponding acceptance regions 
dalm) are as short as possible, in other words, that 6, has property (i) stated in$ 1. The system 


4 Әтове as am 
Contin: 
tinuity of the sum (6) as a function of m for fixed c, and cs. 
k le accepta; ж 
om s aking a change in c, or с, or both 
in ( Small to larger values of m, starting at m — 0, and making g 1 2 
(с ili zh 
НТ" 2... у. ty sum of less than »'4- 1 
tma ® = itr’), then by definition of the A,(m) no probability 


Ins 
lu 
not q 


e қ к 2 E 3 4 
“eptance regions.) We therefore maintain the same size if possible by substituting cf +1 


for с! cá+1 р 
" ains 1 —€ for A Я 
р ра Dus 64 0.4 for Аз(т), or where >. р (т) first attains 1 —e for A,(m). If the 
Tobay 4. NEN і-сі ч , Қ 
ability Sum drops to 1—¢ before substitution is possible, we enlarge A;(m) by adding 


28-2 
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i i i tance 
cat 1 to (ci, --- cs). Alternatively, it would be possible to shift or enlarge the accep 


P Я Е idence 
region to the left, adding су — 1, but to no useful purpose; in fact it would yield confide 
sets that are pairs of intervals rather than single intervals.* 


4. COMPARISON OF SYSTEMS OF CONFIDENCE INTERVALS 


For the purpose of comparison the à, 
obtained from Pearson & Hartley's t 
algebraic and percentage decreases 
coefficients of 90 and 99 9. 


" sstem 
system was also computed for c < 50 and the д, SY a 
ables 40 and 8. Some lengths d, of 6, and correspon 


А : о ур idence 
in length from d, are shown in Table 2 for confic 
The results are rounded. 


Table 2. Comparison of dı, аз and d, 

П | 
d, d-d, | Wadda | пода 

в E | | 
90 99 90 99 | ə | gg 90 | ыы 

| | | 
| | | 0-1% 
0 3-00 530 | 056 | 0-53 187% | 9.90, 0:19 .3 
1 4-69 7-43 0-27 0-52 57 | 70 -33 | ] 0 
2 5-94 917 GN | 060 | ma | gs o0 | 97 
3 6-94 10-64 0-53 060 | 16 | 57 09 | 2 
4 7.79 11-92 0-94 037 | 120 | 31 ONES йы 
5 8-54 13-07 1-26 0-56 МЛ | as | gau | O9 
6 9.23 14:19 0-32 0-63 3-5 4-5 24 | 07 
1 9-86 15-10 092 | 0.63 9.3 41 | 42 | 01 
8 10-45 16-01 1-43 | 055 13-7 за | ав | 97 
9 11-01 16-87 0-24 091 | 29 54 8 | 2v 
10 11-54 17-68 1-59 1-13 | 13.2 6:4 43 | 29 
12 12-52 19-20 1-63 921 | 130 | i | 41 | 09 
14 13-49 20-01 | 099 1.28 7-4 62 | 33 pi 
16 14-26 291 | 915 | 981 | тү ыы 1-9 ре 
18 15-06 2915 | 034 | 0% 1-6 0-2 за | œ 
20 15-81 2432 | рэт 0-61 | - | 24 
25 17.54 , | %-- ү ту ту 
27-00 1-71 0-42 9-7 1 3.2 : 
30 19-10 29-44 0-03 1:30 " : g | 15 
40 21-87 Е "f E qe d 
ШІ ae 3377 | (0.90 125 | а 37 | r8 0 1 
37-61 | 0.08 ШЕ 0-3 4-6 | 1-4 " 


ррг eciable or at least mall valu 
sea i Sr n 
SC increases, The re 


Je 
lativeim ; reci 
Provement of д r ô, is also app е 
for small values of c but occasionally becomes negative, H 4 Over б; & and % Һау, 
acceptance regions of the same Size, the average of (d 9. However, since 3 ny fin t 
о - 7 = to a 
value exceeds the corresponding average of 4—4 Via d,)/d, DVvóro = 0.1, 
3 
* Тһе confidence Sets for the 1958) 
z parameter of i T ks x i ( i 

occasionally consist of pairs of inter als, rw e distribution considered by Clunies-Ro®* 0,00 
Hence his Statement on P. 278:*In practice espe foie 


n e 
: confidence coefficients 0-90 aPC ү 
it seems that th ieti i uil 
0, 0-9000*, needs е ion- 26 restriction (4.6 a, b) is not ne 

Orrection ; see Crow (1956), p. 425. 
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Tt is of interest to compare the tabulated д, limits with various large-sample approxi- 
mations in order to test the validity of using the 6, limits as far as they are tabulated 
апа an approximation beyond that. Such a comparison is summarized below for 
confidence coefficient 95 % and c = 50: 


Table 3. Comparison with large-sample approximations 


| % Diff. of 
H 2 | 
Identification of interval | Eq.no. in Upper | ИЕ | Tings 3 | —— 
| Hald (1952) | limit | limit interval | from а, 
EY, _ к жәме - | 
8, = | | 37-11 28-51 0-0 
5 | ра | 37-67 28-18 —2.2 
9, = 37-67 27.28 — 5:3 
Normal üpprox. wi 
prox. without | = | E о р 
continuity correction | = 65-91 d | көтек me 
9rmal approx. with | | = | о 
, Continuity correction (22.5.4) 66:45 Ра | авия ы 
Pquare-root normal approx. | 
With continuity correction (22.5.5) | 60:99 d pap Ee 
Mean of (22.5.4) and (22.5.5) — 65-93 87-08 28-85 +0-1 


Quite Benerally the continuity correction lengthens the interval by very close to 1. Тһе 
mean of the two approximations with continuity corrections is closer to бу than either one 
Ugly for both upper and lower limits in the above instance, and this seems to be the case 
9*6 ббала even substantially smaller c (cf. Hald, p. 718). Consequently the suggested 
“8e-sample approximation, effectively identical with the mean of (22.5.4) and (22 5.5), 
u Simpler by virtue of averaging the two approximations to percentage points for a given 
ald’s (22.2. 5115 


7)) before solving for the limits, is 


-1 


Wher setae Е 
е is 1 f Н stribution of mean 0 an 
Ug 15 the upper 1006/2 percentage point of the nor mal dist a and 


Vari Н атаа ня 
ine i “Ті rt wer for the lower limit m, . 
5 508 L, and the upper signs are for the upper limit my. the lo b. 
Mation = 
7 
Mr а; 
al 


(7) ds remarkably accurate approximation to ô; the abolute error in Hirom 
: Pparently is less than 0-1 for c > 0 when є = 0-10 and for c > 2 when e = 0-05. Ап 
сы native approach is that of Walsh (1954), who considered, rather than the correction 
dem ° (7), functions of c which he then determined so as to render 2. "e 
900 аа Tange. Comparisons similar to the above for — Roh egeo = s "i 
in hens about the same order of differences. Since the б, s [e a? sale oe ý мес: 1 
Sin all Observed cases and the percentage difference in length tends to zero as c 
es infinite, it is concluded that using б, intervals as far "s they are tabulated and the 

> sample approximation (7) for larger c is a valid and satisfactory procedure. | 
to th. "comparison with д, the randomized 90 % confidence interval aes calculated according 
е Procedure of Stevens (1957) for the first few values of ¢ and intervals of 0-1 in the added 


ee 
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iformly distributed variable.* The mean lengths, say d;. of these intervals for each с 
= compared in Table 4 with the corresponding values of dy: 


Table 4. Comparison with mean randomized intervals 


e d; d, 100(4-4,)/4, 
0 9-04 2-44 — 16-4 % 
1 3-90 4-43 — 12.0 
2 5-11 5-44 - 641 
3 6-08 6-41 = 51 
4 6-91 6-85 + 0-9 
5 7-66 7:28 + 52 
6 8-33 8-91 3685 
Т 8-96 8-94 + 0-2 
8 9-54 9-02 + 58 
9 10-10 10-77 - 6:3 
10 10-62 10-01 + 61 


The system б, varies irregularly in length and location while 4; varies smoothly- Hoy 


i i . dé ndi 
ever, in any particular case 9; is determined in part by a random number a between 0 an 


H 1 Ы 
and hence will take on Some irregularity; as an extreme example, the 90% interval fo 
€ = 0 and x < 0-05 has length 0. he 
Crow (1956) discussed briefly, in relation to the binomial parameter, the use of i 

expected length of a confidence interval аз a criterion for choice of interval. In a simi 
14 


Expected length of interval 


8 
i 10 12 14 


4 те, ОВ confidence intervals for 90 96 confidence. 
YP; бү: tabulated herein; б: randomized central. 

* A slight change from his pro 

the quadrati 


бү: central; 43: Sterne 


vens’ 
0, 1, 2, 3, the exact formula ву 


б 1 is not e 
| small c, so that it is desirable ima та кена ышыне E 
imit is US 
apolates, but one except: ate (from x = 1). The true limi 
than the backward с: Айы келе exception was found. The forward extrapolate is more acc 
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ES Ы: Repeated lengths of бү, дз, д, and the shortest unbiased system, say д», were 
кз с pee values of m. the last one by Stevens's procedure, using Molina's table: 
EL Е engths are graphed in Fig. 2. The randomized system д, has the smallest 
wih Pe a = iin the range of caleulation, but the difference from that of д, is not more than 
ае н : m < 14. The expected length of à; lies between those of б, and д, except that 
Bios рн st hat of à, slightly for 0-01 < m < 1-5. By using à, instead of д; the experimenter 

xpect’ a 90% confidence interval shorter by between 0-45 (for т = 1) and about 


0-9 (for 7 X m < 14). 
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448 Table 1. Confidence limits for the expectation of a Poisson variable 
Confidence coefficient, 100(1— є) % 
ЕЕЕ 
e 
80 90 95 99 99:9 
| S 
L | 5 wor | Upper 
ower Upper Lower Upper Lower | Upper Lower Upper Lower 
= = — € |= d 
065 
0| 0 1.819 | 0 2436] 0 3-285 | 0 ати | 0 T : 
1 0-223 | 3-546 4-532 | 5.323 0-010 6-914 0-001 bec 
2| 0824| 4758 5-976 $086 | 0149 | 8727] 0045) со 
3| 1535, 5882 7-512 81021 0426 1 1003) i80 | ш 
4 1-819 | 7-564 8:597 9-598 0-823 12-347 0-429 17.114 
5| 2645 8599 | 9716 | 1970 11-177 | 1279 13-793 | 0-739 sat 
6 3546 | 902 | 11342] 2613| 12-817] 1785| 1зэтт| 1:107 10.987 
si 10008 12531 | 3-285 13-765 | 2.330 10-801 1:920 | 51.603 
58 | 12481 “BE 285 | 14-921 | 2906 18362 1971 | © eg 
9 5-696 | 13-243 + го | 29:17 
10 | 5:888 | 15.205 | 16-768 3:507 19-462 2452 | 04.017 
i “h 97208 17-633 | 4-130 | 90-676 2-961 d 
“5 " Р ы | ға 26: - 
b я | n . 19-050 | 4771, 22042| 3491| 27 
13 | 8.520 18537 j. 20:335 | 4771| 23-765 | 4042| 290 
32| баш | 16 6-686 | 21-364 5829 24925 4611 20.37 
| 18938 8:102 | 22-945 | 6-668 | 95.992 | 5:195 
15 9-922 90.414 92 31.999 
i5 | asl зв 8102 23-762 | 6914 27-718 5794 3.622 
17 | 1948] | а 9-598 | 25-400 28:852 | 6:405 HE 
18 | 12481| 23.744 9598 | 26-306 29-000 | 74098! 35.916 
19 | 13243 25.707 15177 | 27-735 31-839 7-065 | 57.384 
20| 15205 | ов 11-177 | 28-966 | 10-009 | 32-547 8.023 | 50-108 
ui Ru! apr: 12817 | 30017| 10473 34-183 | 8-840 p 124 
| 26. в А 1? 
22 | 15438 | 28.469 12817 | 31-675 | 35-204 | 9:560 41.245 
23 16-914 29-983 13-765 | 32-277 3.544 9-561 43-041 
24 | 18537 | 30-159 14-921 | 34-048 :819 10:710 441102 
25 | 18537 | 31.507 14-021 34605| 13.793 | 38.939 | 11:532 45019 
26 | 1894 | 33.03 16768 | 36.030 | 13.793 | ‚373 | 11:532 7.08 
27 | 2041 | 34-49 1671 | 3767 | 1528 | 4130 | 1273 4301 
28 22:04 34-585 ue | 38-165 15-98 | 2-85 13:60 49.32 
29 2204 | 35.92 Hd | 39-76 1680 | 43-9] 13-60 50:97 
30 | 22.33 87-39 20 2. | 40-94 16-80 :36 14:89 51-16 
31 | 2374 | 39.07 "33° | 41.75 | 18.36 | { 15-14 pe" 
32| 25-71 3907 2126 | 4345 | 1836 1611 | 954g 
33 2571 4023 21:36 | 44.26 | 1046 : ivi | 953 
34 2571 41-69 22-045 | 45.98 20-285 ian 1-11 59 
2 à 23-76 Een 2 49-96 17 57-1 
35 26:97 43-25 = 59 47-095 20-68 ая 13:49 4 
45-00 93.76 47-69 5> 51-78 E 582 
36 2847 44-20 26935 ^ 46-38 25.48 9 22-04 52-28 18:49 50:87” 
. 3.025 е 25- 8- и а 599 
27 | д 1449 | 96035] 4707 | 963] d 2204 ^ 5493 | 1987 | 50.63 
39 | 3015 | 4579 | 2800 | 49-19 2631 | prag | 2379 | 5474 | 1999 | (об 
40| 3151 | 4890 | 1999 | 4958 | әтлә 290: | 5014 | 2127 | 63-18 
| Е E 8-99 29-99 50-96 98-97 24-925 57-615 21:60 65-18 
m ao д 3018 | 5254 | 28.97 2583 | 58:35 | 2268 6570 
43 9442 ерт 5 53-46 | 30.09 60-39 | 2317 | (00 
44 3459 | 5264 | sadi | 2409 | 3167 6059 | 2413 | gg19, 
45 3592 | 5439 | ase. | 55445 | 31-678 62-13 | 2408 | 1012 
at Жы 21 33-64 | 5710 | 32.28 63-035 | 25-63 | 20.66 
4 Е 55-1 5. 5 ase % й! 
47 39.07 8548 god um | eum ss А 
48 3907, 5661 | 3700 5048 | 3466) ped 
49 3907 5795 | 37-04 | gia, oe бөзі 
50 4099 5944 8. е 36-0: 7-02 
3944 | 62-69 | 37.67 69:83 


Table 1 (cont.) 


Confidence coefficient, 100(1— є) % 


ы 20 95 99 99.9 
Lower Use | а Жу | | 
| ррег Lower Upper Lower | Upper Lower | Upper Lower Upper 
| 2 | Е Е м 
41-69 | | 
| 4905 10-105 62-86 37-67 66-76 71:56 
44-50 40-99 j 38-165 66:76 73-20 
| 4459 | 40-99 3976 — 6810 73-62 
| 4448 41-74 40-04 69-62 | 75-16 
di 45-22 40-94 71-09 | 76-61 
| ria | 44-87 69-49 | 4175 7128 | 3894 | 77-15 
48-99 44-87 345 72-66 78.71 
| pp 45:00 74.22 80-06 
| 46-38 75-49 80-65 
47-97 75-785 82.21 
| E 77-16 | 83-56 
| $ 78:73 | 84-12 
j E 79:98 85:65 
55 ! 80-25 87-12 
95:16 жа 
- i: TUB 81-61 87-55 
8533 BOF 83:14 89-05 
566] | 2740 84-57 90-72 
ap | S% 84-07 90-96 
86-01 92-42 45-18 100-36 
87-48 94-345 45-21 101-02 
89-23 47-08 102.34 
89-23 47-08 — 104-01 
90-37 97-42 48-01 104-50 
91-78 98-36 49-32 10613 
9-1: 93-48 99-09 49:32 107-235 
90-44 58:84 94-23 5454 100-61 50-97 108-17 
91:79 60-24 94-705 56-14 102-165 51:39 110-17 
93.29 | 61.90 9606 | 5761 | 102-42 | 5176 11034 
94-30 62-81 97-545 | 57615 | 103-84 53-54 | 111-85 
65-645 94-80 62-81 99-17 58:35 | 105-66 113-51 
6745 95-78 9917 | 60:39 | 106-12 
67-45 97-10 100-32 60-39 | 107-10 
67-45 98-51 60.76 | 101-71 60-59 108-615 116-84 
68-49 | 100-28 66-76 10331? 110-16 117.54 
69-36 100-81 60-76 104-40 110-37 119-21 
42 — 101.095 | 6810 10458 111-78 
-67 102-38 105-905 113:45 
7 103-71 107-32 | 11433 
6 105-17 109-11 66-815 | 114-99 
:07 | 106-92 109-61 66-815 | 116-44 
i 7-92 | 118-33 
47 | 106-92 110-11 67-92 
-155 | 107-63 111-44 69-83 11833 
4 108-915 112-87 69:83 ee 
4 110-26 114.84 70-05 121-00 
-27 111-75 114-84 : 122.69 130-26 
113-35 115-605 67-62 131-72 
113-35 116-93 67-62 133-64 
114-11 118:35 68-19 133-75 
З: 115-40 120-36 70.125 135.20 
83-37 116:74 120:36 70-125 1937-15 


Table 1 (cont.) 


Confidence coefficient, 100(1 — €) 95 


226 
99:9 
5 99 
| 80 90 95 ЕЗІ" 
| р Е 
с er ^ Upper 
zs J Lower | 
Lower | Upper | Lower | Upper | Lower | Upper | Lower Upper | - 
| €— | 
| E 4 в 121-06 
101 8788 | 113-21 | 83-73 11821 81-61 6 
102 | 8788 | 114475 | 85-04 | 119-88 | 83414 | 122-37 
103 | 8864 |11579 | 8643 |11988 | 8457 | 123-77 à 
104 | 89-89 | 117.19 | 8805 | 120-55 | 84-57 125-46 96 
105 | 9119 |11895 | 89-04 | 121-82 | 84-67 | 126-26 ү» 
106 | 92-55 | 119-50 89-04 | 123-15 86-01 126-48 53 
107 | 94115 | 119-50 | 8915 | 124-57 | 87-48 | 127-78 эз | 14551 
108 | 9534 | 120-44 | 9044 | 126-52 | 89-23 129-14 3:56 | 136-3 тала | 14713 
109 9534 | 121-67 | 91-79 | 126-52 | 89-23 | 130-68 | 83:56 13 pen тым | меді 
110 | 9534 | 122-925 | 93.29 | 126-945 | 89-23 139.03 | 8412 1209-94 i з | 14892 
| —— f 5 79-53 s 
111 | 96-08 | 124233 | 9480 | 128-20 | 90:37 | 132.03 | 8565 | 140-54 ied 150-46 
112 97.32 125-61 94-80 129-50 91-78 | 133-145 87.12 | od 80-83 151-91 
113 98-60 127.22 94-80 130-87 93-48 | 134-48 87-12 | 142512 81-845 15231 
114 | 9995 | 128-23 | 95-78 | 132-44 | 9423 | 135-92 87.55 | 143-64 xem 153-19 
115 | 10143 | 128.23 | 97-10 | 133-66 | 9423 | 137-79 | 89-05 | 145-13 : 155-518 
" 3.5 " 
116 | 103-02 | 128-84 | 98-51 | 133-66 | 94705 137-79 | 90.79 | 145-19 83 is 195-09 
117 | 103-02 | 130-06 | 100-28 | 134-54 | 96-06 | 13849 | 90-72 14654 A Te 157-12 
118 | 103-02 | 131-30 | 100-81 | 135-81 97-545 | 139.79 90-96 | 148-01 86 à 158:89 
119 | 10345 | 132-575 | 100-81 | 137-13 | 9937 | 141-16 | 9242 | 149-76 dms 159-49 
120 | 10468 | 133-01 | 101-095 | 138-55 | 9917 | 14270 | 94345 149-76 86-4 160-45 
121 | 105-94 | 135-37 | 102.38 | 14054 | 9947 | 144-01 94-345 | 150-93 Hb 162:0 
122 | 10724 | 137.08 | 103-71 |14054 | 100-32 | 144.01 9435 | 152355 | 892 103:36 
123 | 108-63 | 137-08 | 10517 | 140-85 | 101-71 | 145-08 95:76 | 154-18 89-23 16319 
124 | 110-275 | 137-18 | 106-92 | 142-09 | 103-315 | 146-39 97-42 | 154-60 90-34 65:29 
1 11 
125 | 111-18 | 13838 | 10692 | 143-37 | 104.40 | 147.80 98-36 | 155.31 920 167.11 
126 | 111-18 | 139-605 | 106-92 | 144-71 | 104-40 149-53 | 98:36 | 156-69 92-01 167.12 
127 | 111-18 | 140-85 | 107-63 | 146-16 | 10458 | 150319 | 99-09 | 158-25 92:59 | 168.52 
128 | 111-98 | 142-14 | 108-915 | 147-94 | 105-905 | 150-36 | 100-61 | 159-53 9436 | 170.14 
129 | 113-21 |14349 | 110-26 | 147-94 | 107-32 151-63 | 102-165 | 159-67 95:01° n2 
130 | 114-475 | 145-01 | 111-75 | 14835 | 109.11 | 152-96 | 102-165 | 161-01 95:01" 1118! 
қ 5 
ІЗІ | 115-79 | 14645 | 113-35 | 149-60 | 109.61 | 154.39 | 102.42 | 162-46 9040 | 159.28 
132 | 117-19 | 14645 | 113-35 | 150-88 | 109.61 156-32 | 103-84 | 164-31 97-95 | 115.08 
133 | 11895 | 146-66 | 113-35 | 15291 | 11011 156-32 | 105-66 | 164-31 97-95 | 7511 
134 | 11950 | 147-86 | 114-11 | 153-67 | 111.44 156-87 | 106-12 | 165-33 9859 | 116.51 
135 | 11950 | 14908 | 11540 | 155-43 | 119.97 | 18815 | 106-12 | 166-71 | 100-36 1781 
E Í xe | 
136 | 11950 | 150-33 | 116-74 | 155-43 114-84 | 15948 | 107-10 168.29 | 101:02 11927, 
137 12044 15161 | 118-21 | 155.815 114-84 | 160-925 | 108-615 | 169.49 | 101:02 179-10 
138 | 121-67 | 15296 | 119-88 | 157-05 | 11484 | 162-79 | 110-16 | 169-64 | 10234 |112! 
139 | 122-925 | 154-45 | 119-88 | 158-33 | 115.608 162-79 | 110-16 | 170-98 | 104-01 1831 
140 12423 15601 | 119-88 | 159-65 11693 | 163.35 | 110.37 | 172.41 | 104-01 183-19 
141 12561 15601 | 120-55 | 161-075 11835 | 164 50 H 
| 63 1 i 104 84 
142 19722 156-09 | 121.82 | 163.09 120-36 | 165.96 ported md 100-13 1552 
143 12823 | 157-28 |19915 | 163-02 | 12036 | 16739 | 114-33 | 17525 | 10723 181-45 
144 12823 | 15849 | 194.57 | 163.24 12036 | 169-33 | 11433 |17661 | 10723% 1915 
145 12823 | 159-73 | 126-52 | 164-47 121.06 | 169.33 114-99 | 17811 | 108-17 18940) 
146 12884 100-995 | 126.52 | 165-73 199-37 | 1 | aT 0 
| 2.2 69-80 3 Ы 110 9 
147 13006 | 16231 | 126-52 | 167-03 123-77 | 171.07 gehe | ke 11017 101-61 
148 | 13180 | 169-725 | 126-948 | 168-41 | 19546 | 172-388 | 11833 | 180-84 | 11034 | 022 
149 | 132-575 | 165-68 | 128-20 | 170.00 12626 | 173-79 | 118-33 | 182.29 | 111-85 1957 
150 133-91 16577 | 129-50 | 171.09 126-26 5-485 13:51 
| | 175-485 | 119.59 | 183.81 | 1 
If c is the observed freq " 


confidence limi 


uency or count and mr, 
ts for its expectation, m, 


„ү 0) 
my аге the lower and upper 100(1 —6) % 
then Pr (mj < т <my)> 1—6. 


Table 1 (cont.) 451 


Confidence coefficient, 100(1— c) 95 


80 90 95 99 99-9 | 
| 


Lower | Upper Lower | Upper Lower , Upper Lower | Upper Lower | Upper 


Е --- = 


135-37 | 


130-37 | 171.09 | 12648 | 176-23 | 121-09 | 184-975 | 11551 19567 | 
137-08 | 166-648 | 132-44 | 171-84 | 127-78 | 176-23 | 122-69 | 185-08 | 113-95 | 195-67 | 
137-08 167-84 | 133-66 | 173-08 | 129-14 17748 | 122-69 | 18640 | 11552 | 196-84 | 
137-08 | 169.06 | 133-66 | 174-36 | 130-68 178-77 | 122-78 | 187-81 | 116-84 198-33 | 
137-18 | 170.305 | 133-66 | 175-69 | 132-03 | 180-14 | 124-16 | 189-50 | 116-84 | 200-04 
138-38 | 17158 | 134-54 | 177-13 | 132-03 181-67 | 125-70 | 190-28 | 117-54 | 200-05 | 
139-605 | 172.92 | 135-81 | 178-96 | 132-03 | 183-08 | 127-07 | 190-615 | 119-21 | 201-42 
140-85 |17439 | 137-13 | 17896 | 133-145 | 183-05 | 127-07 | 191-94 | 12017 | 202-94 
142.14 | 176.06 | 13855 | 179-18 | 134-48 18386 | 127.31 | 193-36 | 12017 | 20443 | 
143.49 | 176.06 | 140-54 180-405 | 135-92 185-13 | 12870 19519 | 12112 | 20459 
145-01 | 176-06 | 140-54 | 181055 | 13779 156-46 275 | 195-59 205-98 
14645 | 177-14. | 140-54 | 182-94 7:70 | 187-89 -50 | 196-13 207-55 
14645 | 178.34 | 140-85 | 18430 | 137-79 | 189-83 50 | 197-46 | 123-46 | 208-83 
146-45 | 179.56 | 142-09 | 185-80 | 138-49 | 189-83 | 131-82 | 198-88 | 124-08 | 209-13 


146-66 | 180.80 | 14337 | 18730 | 139-79 | 190-21 | 133-21 | 200-84 | 126-07 | 210-52 
147-86 | 5 лә | 200-94 | 126-67 | 212-12 

182-08 | 144-71 18730 | 141-16 | 19146 | 13479 | 2 26-67 212 
149-08 | 193.42 | 14616. | 187-70 | 142-70 | 192-76 | 135-99 | 201-62 | 126-75 | 213-26 
ia 18489 | 147.94 | 188.93 | 144-01 | 194-115 | 135-99 | 202.94 | 128-21 | 213-64 
19L61 | 186-56 | 147-04 | 190-18 | 14401 | 19563 | 136-30 | 20436 | 13014 215-04 
52-96 | 186-568 | 147-94 | 191-47 | 144-01 197-09 | 137-68 | 206-19 | 130-14 | 216-665 
15-45 ^5 5 7 24 | 200-00 | 130-26 

186-565 | 148-35 | 192.83 | 145-08 | 197-09 | 13024 | 2 2 
156-01 | 187-59 | 149.60 | 194-36 | 146-39 | 197-78 | 140554 | 20708 | 131-72 


AA 7 54 | 208-40 | 133-64 | 21 

| 188-78 195.73 | 147.30 | 199-04 | 140-54 |2 

ome 189-99 195.73 | 149-53 | 20035 | 140-76 | 209-81 | 133-64 | 22 
3009 | 191.23 | 19618 |12019 | 201-73 | 14212 | 211-50 | 133-75 | 22: 

РЕ 192-49 | 155.43 | 197-405 | 150-19 | 203-355 | 143-64 13530 | 22 

isan 193-81 | 15543 | 198655 | 150-36 20436 | 145-13 13745 22 

reed 195-22 | 155.43 199-94 | 151-63 | 20436 | 145-18 137-15 22 
50-995 197-12 155.815 | 201-30 152.06 | 205:315 | 145-19 137.22 226. 

162.3 | 55:81 | 3 sth agi 

16 l 197-33 | 157-05 | 202-80 | 15439 | 206-58 146-54 138-66 | 227-09 
чү 5 | 207 8 140-64 | 228-48 

197.3: 58-93 | 204- 56-32 | 207.90 | 148-01 
165-08 | 19733 | 15833 | 20430 | 156-32 | 10,68 | 22848 


197.97 | 159.65 | 204 150.32 | 209.30 | 149-76 
l65. 7 | 159-65 | 204-30 5 3 
reid 199-16 | 161.075 | 204-62 | 156-32 | 211-03 | 149-76 
16577 | 20036 | 163-02 | 205-84 | 156-87 | 211-69 | 14976 |, 
166 : | 201-58 | 163-02 | 207.08 | 15815 | 211-69 | 150-93 | 
167.84 | 209-825 | 163-02 | 208-36 | 159-48 21282 | 152:35° 22 
169-06 204-11 | 163.24 | 209-69 | 160-025 | 214-09 M | 2 
170. 205-45 | 164-47 | 21113 | 16279 | 21540 | 194: Е 
WAP 206-94 | 165.73 | 212965 | 162-79 | 216-81 | 154-60 147-13" | 237-34 
8 | 208-52 167.03 | 212-965 | 162-79 | 218-56 155:31 148-24 | 238-82 
174.39 | 208-52 | 168-41 | 213-08 | 163-35 219-16 ue oed n | 240-56 
176.09 208-92 | 170-00 | 21424 | 164-63 | 21916 | 15855 | Oe 2 


; 209. А 5- 220.29 | 159-53 | 230.03 | 15046 241-75 
176-06 9:49 | 171.09 | 215-47 | 165-96 159.53 | 231.33 | 151-91 | 243-19 


140-74 | 231-34 
142-10 | 231-55 
143-83 | 232-92 
144-55 | 234-435 | 
144-55 | 235-94 
145-515 | 235-99 


116. 210-675 à " 57-39 221-56 5 
| = 5B lade nre fmc 19738 | 292-86 |15967 | 23271 | 151901 245-18 

6-06 : E | 234-28 | 15231 245-18 

177. 213-1 71. | А . 224-26 | 161.01 | 2342 52 245 

Es - 21020 oar A A 10.33 | 225-905 |16246 | 23550 | 153-79 246-15 
| "34 | ер : ecd 3 -31 2935-50 75 | 247-55 

1 215.63 Dices | 592 ij 226-81 164 

79. 174- 222-10 | 169-80 ; 236 ер 

180.26 | 216.985 ге | 29210 | 171-07 | 226-81 | 16431 | 236-68 P^ | 240-18 
| " | 21840 | irria | созбо | 179866 | 22773 | өм |н | 250-20 
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THE DISTRIBUTION OF THE NUMBER OF SUCCESSES 
IN А SEQUENCE OF DEPENDENT TRIALS 


Ву К. В. GABRIEL 


Hebrew University, Jerusalem and Uni versity of North Carolina 


1. INTRODUCTION 


Р : SUME И variables 
Consider a sequence of т trials and an initial trial whose outcomes are the random vari 
X, Xa, ..., X, and Xo respectively, where 


x ( =1 ifthe kth trial is successful 
Ж 
| =0 otherwise, 


for k = 0,1, 2,...,n. Let the probability of a success at the initial trial be 


В = Pr{X, = 1), 


and assume the conditional probabilities, which are denoted by 


21 = Pr{X, = | |X,4 = 1), 
Ро = Pr(X, =l 1... = 0), 
р Е ing the 
a Ж 1,2, ...,n, to be independent of Ё and of the outcomes of the trials preceding t 
c — l)st. 


This sequence is an irreducible Markov Chain with two ergodic states. Its stationary 


probability distribution has ag probability of a success 
Р=— №. 
L= (ш, =Po) 


1- - 
-— d M 
0 
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It is known that S is asymptotically normally distributed with mean ж/н and variance 
26?|n? (Feller, 1957, chapter xirr.6; Smith, 1958). Substituting for и and о?, one obtains 


; 1-а 
E(S)~ пР, Хаг(8) ~ aPü-Pu—y 


Whereq = D — Po. This is an asymptotic result which indicates neither the exact distribution 
for small n nor the rapidity of approach to normality. It seems to have been given first by 
Markoy (1924, § 60). 


2. THE EXACT DISTRIBUTION 
І А ? 
f S successes occur in э trials there will be a number of changes from success on one trial 
1 1 а жыз x : 1 a 
Катац the initial trial) to failure on the next, and vice versa. Denote the number of 
ch қ i 

angos by C and let a and b stand for the least integer not smaller than 1C —1 or 4C 
l'éspectively. 

1 . d H H 
е йм first the case of a success at the initial trial, then S successes with C changes will 
© Olve b changes from success to failure, а changes from failure to success, as well as 5-а 
SUCCEsges қ Е + : та 
| cesses following successes and n—S— failures following failures. The probability of 

n р : i 
У one Arrangement of S successes and n — 5 failures with C changes is therefore 


(1 — ри рарї “(1 — Po)" S$, 


le, 71! z2)! ey 
5 ES жеу; 
РТ — о)" ( —po Mn 


Next 


> ANY arrangement of n trials with S successes and C changes will involve a changes to 


Suce Қ "ү Y мые 
esses which may оссиг before апу а of the S successes, i.e. in any of в different pos- 


Sib] те А 
* positions. Also b changes occur before failures, of which the first must occur before 
d 


the first 6: ъ-5-1\ .. г 
ist failure and the rest may be arranged in any of ( 5-1 | different ways. Given 


the А ; 
uni arrangements of both kinds of changes, the position of all changes in the sequence is 
ашау determined, and therefore the total number of possible arrangements of C changes 


T H : " rug | 
? trials with S successes is I 
а =L 


hanges in n trials following a successful initial trial is 


b a 
: 5) (0-5-1) вр s (152) ey. 
"8. іа em - (2) 5-1 то РӘ" ро Vs 


ырп 
Өх сер. mber of Changes C may be any number from 1 to т+%- |28 ++] = C, say, 
Suc if g= n when О, = 0. Thus the probability of S successes in т trials following a 
Val nitial trial is obtained by summing the above probabilities over all possible 


с, (8% [n—S—1 тау ey 
PriS |n, X, = 1) = а-ә: $ (2)( b-i i ms и * 


"mong 


). Hence the probability of 5 successes 
With с 


cessful] į 
Чев of су 
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In a similar manner for the case of a failure on the initial trial 
_s © (S-1) (n-S) (1-71\* а) 
Pr {S |n, X, = 0} = р? (1 – ро)" > 5-1 py i-m) ud 
e is 
where C, = n-- 3— |28 — 1 —n|, except for S = 0 when C, = 0 and the summation abov 
understood to include only the term C — 0. 


Р З : -obabilit, 
Ав one would expect, when d — 0, i.e. in the case of independent trials, either probab y 
becomes that of the binomial distribution. 


"- | “tional pro” 
Finally, given the probability В of a success at the initial trial, the unconditional 1 
bability of S successes in the sequence of n trials becomes 


Pr {S |n} = R.Pr(S |n, X = 1+ (1 R).Pr(S |n, X = 0). 


im- 
This form of the distribution is rather unw ieldy but does not seem to lend itself to $ 


means 
plification. It may be easier to obtain an impression of the closeness to normality by 
of the moments. 


3. MOMENTS OF THE DISTRIBUTION 


d 18 
. » 5 S ses Si 
For a sequence of n dependent trials the rth factorial moment of the number of successe 


"фм Ж Prk, -ь X 2-1) 


ИЕ, 
4,<1<...<1, 


ла 


where i, i, ...,i, is any subset of r of the integers 1 


,2,... n (Gumbel, 1938, table 1 а 
formula 17). Putting k; =ù; 


=i; (j = 2,...,r) and k = i, this becomes 
rd, = | Xo Pr(X, еі; Kea M e 
where Y denotes summation over all sets 
sum is < n. For a Markov Chain this becomes 


быға. 1), 


‚у whose 
of r positive integer suffices (k4, .- SE) W 


(n XP Pr(Xj, = ОРЦ ы-і Қам “РАХ, ыы 1| Жанан” I} 
Now Pr{X,=1 |Хыұт = Р(1 em di Pr (X 


kia = l| Xp Y 
for = 1,2,...,mandi = ia 


‚® (Feller, 1957, р. 385). In particular, 
Pr DO xi | n VELA = 1} = Р+(1 —P) ач 

Pr(X, =1}= P 4 (0 — Pd, 

Introducing these probabilities into the expression for the factorial moments. and writine 
f = 1/(1—d), one obtains à 


pı = nP * (t — P)df(1 — d"), 


and 


б = PTT 4 PUL R—2Pyaptn( - )-10 -4]-- 0 — Pg. Рама - d") 
alates 
—3n? +2n 
фи = рз" % + P?(24+R—3P) HPO -d-na ау (3—4d) 4 2(1 —d”)] 
+P(L—P)(1+2R—3P)d )df*[nd(1—q 1) —24(1 ан) лата ау] 


+(Е - Py -PPAR а") + $^ 0 ау, prt d) (1 agy 
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y= piti SM NNO pas АРМ -dp - f - d Qd) 


+n(1—d) (11 та 202) — 6(1 —d")]+ 3P?(1 — P) (1 4-.R - 2P) 
x Mf*pi?d(Y — dy? — nd(1 — d) (5 —d) — 2nd"(1 — d) + 6d (1 —d")] 
*PQ- Pj 38—AP)df'pi*( —d) — За – 4") + та — d)? 
~4nd"(1 — 4) (1 —5d)] + (R — P) (1 — P df*[d*(1 — d") – $nd"(1 — d) (2 — 7d + 11d*) 
md" (Y — d)? (1 — 2d) — id"(1 — d]. 
These expressions are unwieldy and their substitution into the formulae for the moments 


about the mean (Kendall, 1948, chapter 3) does not yield simpler forms. In numerical 
Work j қ ; 5 
vork it would appear advisable to compute the 0 values first and then obtain the other 
Moments from them. 
Again, for d = 0 these moments become those of the binomial distribution. 
In the Special case R = P, whatever d, i.e. in a stationary Markov Chain, the first three 
Moments are: ' 
Aj = nP, 
из = P(1 — P) [n(1 +4) f - 24/1 —d")], 
из = P0 — P) (1 — 2P) [nfl + 60/201 + d7)) — 60014) PPA = 7). 
Шу or large, ^, approximations may be obtained by ignoring terms 1n d" and terms not 
Olving ».* Then 


ду & nP, 


4 


1+4 
из я пР(1- Р) E 


1 1-2P 1+4а+4 


у: я Jn JIPU - P) (1 +а) уа -d 


2)? 


1 1-6P(1—P) 1+ 10d +a 
Va ie. P(1—P) 1— 4? 


Wher, 
Standa pis involving (R — P) and n- or powers of n™ are ignored, and у, уа are the usual 
us 

hi zed 3rd and 4th cumulants. : 5. for a sufficienti 
lon в 5 level of approximation the cumulants are independent of R, i.e. to befactoni 4 
into quence the outcome of the initial trial is immaterial. д», y, and Уз can ea : du 
in. Corresponding term for the binomial distribution and a term pei 2... 
the ая Бел, y, and Уз Will vanish whilst д and дә correspond to the mean i ida m 
andy, =. Ptotic normal distribution (§1). For any т, P алй d, the approximations to Y; 

алау give some indication of the deviation from normality. 
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4. NUMERICAL CALCULATIONS AND AN APPLICATION dm 
An idea of the form of the distribution of S is given by the cumulants = тее d 
Tables 1 and 2. The exact computations are cumbersome and have "um : v езін! 
to a few selected values of the parameters. It will be noted that for values о безе е. 
0-4, /ә and уз will be as for Р = 0-8 and 0-6, respectively, y, will change sign, an 
will be minus the mean for P = 0:8 and 0-6. 
” 


Table 1. Cumulants of the distribution of S in sequences of seven trials 


МЕн ызын 
D е 0-6 0-8 
— 
0:8 
gm 0 0-2 0-4 0-8 0 0-2 ve | 
ТІ x P " 56 
т 42 49 49 4.2 5-6 5-6 65, 5.02 
Exaot 1-08 2.37 3-39 1-53 1-12 1-58 ЖАЙ 10-08 
Ёа \Approx. | 1-68 2.52 3-92 15-12 1-12 1-68 WE | ud 
Exact | —0-154 | —0-228 | —0-294 | —0.394 | —0-567 | —0-837 | — 1,081 | ~ 540 
Уз \Approx. | —0-154 | —0-241 | —0.332 —0-691 | —0-567 | —0-887 | —1-219 0.709 
Exact | -0261 | —0-442 | —0-690 | —1425 | 0:036 | 0.380 | 0-686 0:956 
Үз Approx. | —0-261 —0-828 | —1-606 | — 7-001 0-036 0.1113 | 0219 
= 
Жы. 
Table 2. Cumulants of the distribution of S in sequences of thirty trials > 
Р 0-6 0-8 
8 
d 0 0-2 0-4 0-8 0 0-2 0-4 ы 
LL eed 
4 
т 18 18 18 18 24 24 24 : 1 
еш | 7-20 10-65 16.27 55-45 4-80 7:10 10:84 EET 
? | Approx. 7:20 10:80 16-80 64-80 4-80 7:20 11:19 MT. 
DN | = 0075 | —0-115 | —0-157 | —0-297 -0:274 | —0423 | —0-516 | 1927 
* Approx. | —0:076 | —0117 | —0160 | 033. | L 0274 | -0429 | —0-589 | оп 
" ы — 0:061 | —0-109 | —0-181 | . 0.684 0-008 0-105 0:227 0.223 
Approx. | —0-061 | —0393 | 0.375 | _ 1-633 0-008 0-026 0-051 


For d — 0 the distribution is binomial, 
Skewness and kurtosis increase consid: 
as they do those of the binomial, Thi 


, 


jant? 
as was noted. With increasing d, the yana i 
erably. Changes in n and P affect the moments” g 
8 regularity holds only if d? is negligible, e-g- for 
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eing considered as * à 
$ d аз ‘successes’ and ‘failures’, res tively i iliti 
or January = bec pectively). The estimated probabilities 
he exact cumulants are therefore 
в, = 14608, да = 18744, уу=+0040, a= -0:195, - 
While the large s ; я 
ge sample approximate value of the variance is 
Jt; = Р(1-Р) (14d) 4) = 17:240. 


arameter values the normal approximation using the 
о 


Т, 
able 3 shows that for these p 
Observed data are 


larg 

e sam M : В 

ааыа P variance is very close to the discrete exact distribution. 
or only 27 years and are therefore not presented in the table. 


Table 
5 ossi distribution and asymptotic normal distribution, the comparison being made 
he values n = 31, P = 0473, d= 0-381 (suggested by data of January rainfall at 


Tel Aviv) 
Ё— 
No. of Probability Probability Probability 
Successos No. of No. of 
E successes successes} 
TN xact Approx. Exact Approx. Exact Approx. 
ecc» ——— = (| Былыр 
0 
1 0-0000 | 0-0003 
2 0:0001 0:0004 11 0-0671 0-0651 21 0-0306 0-0301 
3 0-0004 0-0009 12 0-0794 0:0782 22 0-0206 0:0202 
4 0:0010 0-0019 13 0-0890 0-0886 23 0-0129 0-0129 
5 0:0027 0-0036 14 0-0942 0-0947 24 0.0075 | 0-0078 
0-0057 0-0065 15 0-0946 0:0956 25 0:0040 | 0:0044 
6 
7 0-0105 | 0-0110 16 оовөө | 00907 | 26 0.0020 | 00023 
8 0-0177 0:0176 17 0-0810 0:0819 27 0-0009 0-0012 
9 0:0275 0:0266 18 0-0692 0:0696 28 0:0003 0-0006 
10 0:0396 0:0380 19 0:0558 0:0557 29 0:0001 0-0002 
0-0531 0-05 0:0426 0:0422 30 0:0000 | 0-0001 
m 50 31 0.0000 | 00001 


Note on computation 
dious. It involves evalua- 


distribution js rather te 
mmation signs of the expressions for 


Рг{5 | Хо = 0}. 


Com е 
tion о а of the exact probability 
he n(n + 1) + 2 terms inside the su 
Thi Pr(S |n, Xo = 1) and 
tes best done with the help of tables of logarithms of binomial coefficients (Hald, 1952, 
XIV). І am happy to ackno wledge Mr k in computing the 


“tribution of Table 3. 


s В. Falk’s precise Wor 


Iwi е : 
Wish to thank Profs. W. J. Hall and W. Hoeffding for their helpful suggestions. 
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Editorial note added in proof 


The reader may care to note that a paper by P. V. Krishna Iyer and N.S. Shakuntala 
on ‘Cumulants of some distributions arising from a two-state Markoff chain’, which has 
appeared since the present paper was sent to press (1959, Proc. Camb. Phil. Soc. 55, 273-8) 


covers some ground in parallel with that covered by Mr Gabriel. в. S. P 
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TABLE OF THE UPPER 10% POINTS OF THE 
STUDENTIZED RANGE 


Bv J. PACHARES 
Hughes Aircraft Company, Culver City, California 


]. INTRODUCTION 


Let g = 

deen sa ie familiar ‘studentized’ range when sampling from a normal population, 

on p ignes: ж з erange of n values and 8 is а sample estimate of standard deviation based 

the "e ei ere and distributed independently of w. If F(q) = Pr (20/8 < q), then 

зы a А or which F(q) = 1 —« have been computed to three significant figures for 
n = 2 (1) 20 and v = 1(1) 20, 24, 30, 40, 60, 120, co. The values of q when a = 0-05 


Were 
also т : 2 : 
ecomputed for checking purposes with the result that several minor disagree- 


ments wi 

WI Pears В r : 

Well аз ios Pearson & Hartley's table 29 (1954) were noted.* The need for 10% points as 

in his Eh extreme points was originally suggested by Prof, Henry Scheffé (1953, p. 91) 
per diseussing methods for judging all contrasts in the analysis of variance. 


2. FORMULAE AND NOTATION 


Let à 
що) = (оп) ев, А0) = [0008,0 Ы 
J0 


ange in samples of n from a 


and 16 
tP 
„l ution function of the r 


Хо, 1 t) be the cumulative distrib 
Population. Using the form given by May (1952), we have that 
(= = [eatin Од OPO q) 
0 
Where 
P(t) = «| z(u) ( (аз!) — E(u))" du. (2) 
n alte к -о 
“nate form for P, (t) derived by Hartley (1942) is given by 
It P(t) = (2E(30))" + 2n L z(u) (E(u) -Е(и —py du. (3) 
n à и 
= Өз: 
Ìt is easy to show that 
(4) 


F(q) = BG, i) 
2? 


р arc tan (a/v (27) 
: (cos uy du. 
àv) Jo 


COMPUTATION 

IBM 704 computer to evaluate H(t) to 
a table of ЕДІ) values in memory for 
as of the Taylor series expansion of E(t) 
s, herein referred to as routine 1 
e of the H(t) sub- 


3. METHOD OF 


Subroyt: 
a Б pan а first programmed using the 
abo 0-2) 3 P» 2 figures by storm 
ang the tome. (1) 9, then using the first x Ts 
Toutine 9 st value of t in the stored table. Two 
» Were then programmed for the IBM 


routine 
704, both making us 


ж See Editorial Note оп р. 462 below. 


469 T'able of the upper 10% points of the studentized range 


А T ine 
ine already mentioned. Routine 1 was used to solve for the percentage points d 5 
en af ә checking purposes. Routine 1 computed P,(t) from (3) for Ы = | ic, 
s Dia ег (53h these values being stored on magnetic tape for use in (1). Тһе valu 
wish dida ға) = 1—@ were found to three signifie 
interpolation іп 4. The integral in (3) 
formula with u = 31 (0-125) 8 usin 
integrand at the lower limit; the 


integral in (1) was computed usin 
when n = 


ant figures by iteration using ". 
was computed by the Euler-Maclaurin ұлтар?” е 
g terms up to and including the first ees = тё 
derivative at the upper limit was taken to be s ie ofq 
g the trapezoid rule with / = 0 (0:125) 8. The va " were 
2 and v — 1,2 were found explicitly using (4). For v = оо, the таяз о k^ №. 
computed ab initio using iteration and linear interpolation to solvo theequation ғ) 120 4.5. 
where Р (4) was computed from (2) using the rectangular rule with и = Ed Е crude 
Starting values were found for the first column of the table (i.e. n = 2) by using а 


: ase 
i 5 A "a given c 
interpolation between the values found at р = 1,2 and v = оо. The solution of a g 

served as the initial guess for the following case. 


4. METHOD or CHECKING 


ted 
е : ine 2 which compu 
substituting them into routine 2 which с 

P,(t) from (2) usin 


-ed the 
и = — 6 (0-375) 6. Routine 2 aeie 
ation formula with terms up to ptam upp? 
us er limit of integration, the value at и gran 
limit being taken to be zero, the i ion being Stopped when the value of the ov value 
resulted in altering only the first few 


sont 
tion 

Асы А - her ра 
appreciation to Lois Matsunaga. for her Į 


т KS e те a 
Е eeded to produce the final results. үези ны E 
Y Scheffé for indicating the need for the upper 109%, points of the 


TLEY, H, 
Press, En 


ity 
E. iago Unive” 

` 0954). Biometrika Tables for Statisticians, 1. Cambridge U” 
боне, H. (1953). Biometrika, 


CORRECTED TAB Е 
LES о ма 
OF UPPER 5% Anp 1% POINTS ор THE STUDENTIZED R^ 


Ру Editorial note f ә 
Short explanation of the basis of th ио, pas? 
] е ассот nm. ү 
| panying t. 1% P^ i 
pi nie е Seems called for, In Publishing the es % dir n riis oa 0; 
m 8 Joyce Мау’в calculations, Prof, Hartley and T (в; ы ig NU o ees " BRE S p 
able 29, ang Introduction, р. 52) recognized th | iam ee ерт 
to be present, which we hoped would not 
described in th 


t 


were 
of В go? 
his calculation of the 10 % rpo 


e 
s D 
қ m Our last figure, i 80 
sus s Very extensive computation г Г 
esearch Lab 9 being carried out under the direction ^ ohio 
aboratory of the Wright Air Development Center: 


distribution of 


range and stud 
Harter in the 


Aeronautica] R 
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work is being published as а W.A.D.C. Technical Report (no. 58-484, vols. 1 and п). Since this pro- 
gramme included the calculation of 10 95, 5% and 1% points of the studentized range to high accuracy, 
the editors of Biometrika felt that it would be useful to take this opportunity of printing, with Dr 
Pachares’s paper, corrected tables of both the upper 5% and 1% points, given throughout to two 
decimal places. 
ful to Dr Harter for allowing Biometrika to make use of his 1 % results in advance 
of publication. ‘This has made it possible to identify some 30 last figure unit errors in Miss May’s 


original 1% table. Further, in the tables at all three levels it has been possible to include a second 
decimal for low values of v, where both Miss May and Dr Pachares had rounded off to one decimal 
Place only*, Finally, Dr Harter and his collaborators have gone to the trouble of recomputing certain 
values with a smaller tolerance where a 5 in the final figure of their fuller tables left uncertainty in the 


rounding off to two decimal places. 


We are most grate 


An interesting point which appeared in a comparison of the Pachares and Harter rules for deter- 
Mining the final figure in rounding off for the 100 P% point, was that under the former rule that 
Value of @ is takon which minimizes | P(x) = P|, i-e. gives a probability integral nearest to the required 
Value of P, On tho other hand the Harter rulo takes for 2 that value which is nearest to the value 
9f X for which Е(Х)-Р exactly. These rules almost invariably lead to tho same answer, since F(x) 
Will increase nearly linearly for small increments of 0-01 in x. However, in one case at least the rules 
do lead to different conclusions: for the 10% point, n= 12, v= 16 tho Pachares rule gives 4-81 and the 

Tarter rule 4-30; the former value has been inserted in Table 1. Е.5.Р. 
* Dr Harter sk с . however, that where the tabular value lies between 10 
and 100 dsl ion pred venio si ry rre guarantee that the second decimal place is not in 


en 
Tor by one unit. 


464 Table 1. Upper 10% points of the studentized range 


Ж | | 
2 3 £ |o» 6 | 7 8 9 10 
v | | 
| 
| | — — 
| 
1 8-93 13-44 16-36 18-49 2 | .5 22-64 23-62 
2 4-13 573 | 67 T54 | ý | 863 | -05 9-41 
3 3-33 4-47 5-20 5-74 пк ge | eer 7-06 
20 | : 6-81 7-06 
4 3-01 3-98 4-59 5:03 өз { 5-93 6-14 
5 2-85 3-72 4-26 | 466 98 | s 546 | 
6 2-75 3:56 4:07 4-44 | қ қ | 547 
7 2-68 3-45 3:93 498 | 7g. | oat | 25 
8 2-63 3-37 3-83 417 ШЕ Du 
B is i 8 43 65 4-83 1-99 
2-59 3-32 3-76 4-08 t А 79 | 7 
10 2-56 3-27 3-70 402 | È Ё че d 
ae | | 2 4 47 | 4464 4-78 
11 9-54 3-93 3-66 396 | : | | wey = 
14 | 22 | 390 | 5%) gal т е i үзді; 
13 2-50 318 | 350 ve | En 451 | 4-65 
14 249 sam | А $88 | é . 4-46 4-60 
16 3:56 3-85 | | 
9 а T ла 
15 2-48 3-14 3-54 203 | 4-42 4:56 
16 2:47 3-12 252 $6 | : UR 4-39 4-59 
17 9-46 3-11 Er = | Е i | 4:36 4:49 
= 3:50 3-78 | . $ | 4 : 
18 245 SIG 433 | 446 
3-19 3-77 398 | 4.16 з 
19 9-45 3-09 = E 4.31 444 
20 ; 3-47 3-75 3:97 ВЕ +26 9 
pen HE im "es 4.29 442 
"92 4 | 497 & 
24 249 ав: | 2:5 a | i 4.27 440 
2 2-40 3-02 | 339 3-65 | T. ; 421 4-34 
© 2:38 9-99 935 | за | Sar pec E 4-28 
| 336 | 2% $31 gam | 250 4-10 421 
234 | 293 3-98 ойым 1 3 4-04 | 446 
Ф 28 | 500 $94 | 3 ; 3-86 3-99 4.10 
ЕЙ. M | | $81 | 3.93 4:04 | 
| 
КӨ p T en 
11 12 13 
14 
v 15 16 17 18 
N | 
——— | 
l | 2524 | 959 | 
"92 26-54 7 
2 10-01 1 9 27.10 27-6 | 
3 | 749 p 1049 | 10-10 10-89 | Thor | 28:54 | 28:96 
4 фр i. 7-83 7-98 өз | 197 | 1184 11:39 
5 | 59 | ej | ga | бї | тоз | ті | 527 | 848 
Я | М | ^4 La 
6 504 | 5 6-34 644 | 6.5 e dies 
“76 езе в 6-54 6-63 " 
7| 54a es 5:87 598 | 60 3 E | 
8 | 525 | gag | 564 5-74 583 m 6:25 Dod 
2| 513 | sss | gee | 556 | ses | 59 | 599 | бов 
10 5-03 5413 FE 5-42 5:51 | AB 5:80 5:87 
и | 495 | 505 | . | 592 | одо | 541 fen aie 
12 4-89 4-99 215 533 5:31 64 ih 
13 488 | 49 5-08 | 5.16 5.2. wag 5-45 BBL 
14 | 479 «| 59] sm| F| 93 | ss 
15 488 | дут | s | 518 | Beg | ОШ 225 
4:75 4-84 4.93 5:05 5-12 | к 19 5-31 5-37 
16 | ат | 4g SoL | sos | бы | 528 | 532 
17 | 46 | алт | 289 | 49r 5-04 | me d 
IB | 45 | weh | ШӨ) 9885 | gor 91 517 5.93 
19 | 46 | алә em 490 | 4.98 5:07 513 5-19 
20 461 | ал n 4-88 4:95 5-04 510 | 516 
24 | 454 | авз i $36 | aoe | 10 5-07 5.13 
80 | 44 | ав | 171 | атг | au. | 799 | 505 | 540 | 
29 | | £x | T | 4m | фе | бт | dg 5-02 
| 60 | 43 | ао | 256 4-63 4-68 4-83 489 | 4-94 
Fei 25] зо | S | ш 2 | sd 4.86 | 
© юз ge | 14%! 149 | жи | E | Tus 478 | 
| E Nd | 41 4-47 4-59 4-65 4-69 | 
sue он | #6 | 4857 4-61 
le E 


Table 2. Upper 5% points of the studentized range 
о 9 5 
n | | 
ү 2 3 4 5 6 | 7 8 9 10 
| 
prm | 
д | 20-08 | 32-82 | 3708 4041 4312 | 4540 
з | 8-33 980 10-88 11-74 12-44 13:03 
4 5-91 750 | 804 | 8-48 8:85 
5 504 | 6-29 671 | 708 7-35 
© 4-60 | 22 5-67 6-03 6-33 6-58 
7 4.34 4-90 5-30 5603 | 590 6-12 
8 4416 4-68 5-06 5-36 5-61 5-82 
9 4-04 4-53 4-89 5-17 540 5:60 
10 3-05 4-41 476 | 5-02 524 5:43 
ii 3-58 4:33 465 | 491 5-12 5-30 
12 3-82 4-26 457 | 482 5-03 5:20 
13 3-77 4.20 451 | 4-75 4-95 512 
14 3-73 4-15 445 4-69 | 488 5:05 
15 3-70 411 4-41 4-64 4-83 4-99 
16 3:67 4-08 1-37 4-59 4-78 4-94 
17 3-65 4-05 4-33 | 4-56 4-74 4-90 
18 3-63 4-02 4-30 4-52 4:70 4-86 
19 3-61 4-00 459800 449 4-67 4:82 
20 3-59 3-98 4-95 447 4-65 4:79 
2d 358 | 3:06 4-23 445 4-62 4-77 
30 2.92 3-53 3:90 4-17 437 | 4:54 4:68 
40 2:89 349 3:85 4410 4-30 4-46 4-60 
60 2.36 3-44 3-79 4:04 4.23 | 4:39 4-52 
120 2-83 340 | 3-74 3:98 | 4:16 4:31 444 
% 2-80 3.36 3-08 3:92 4.10 4.24 4:36 
9.77 3:31 3:63 3-86 4-03 447 4-29 
" т 
11 із 35 14 15 16 17 18 19 20 
о 
ч. ЗЕ | |. рей | 
1 " ә | 5722 | 5804 | 58.83 | 59-56 
50-59 .06 59.9 54-33 56:32 57:2 2 Б = 
2 | tags | ibis 5820 | 1688 1591 | 1614 | 1637 | 1657 | 16-77 
3 | әлә | 995 | 1015 | 1035 ipi | 109 | 100) Мы 
i | воз | gar | вт | өш s79 | 891 | 903 | 913 | 928 
$ | т 732 | 747 | 7.60 | KE wd] m EO) 
6 6-65 p И каш 1-94 1-34 7-48 7-51 7.59 
Н 6-30 6-43 rear 6-66 6-85 6104 7-02 ез ur 
В 6-05 20 6-39 6:57 oes 8:74 Ба BLU 
9 5-87 6:18 6-29 I 6-36 6:44 | 651 6:58 6-64 
10 5:72 5:98 6-09 Hae | 619 6:27 6-34 6-40 6-47 
ц БЛП 5-83 Bm кет 6-06 6:13 6-20 6-27 6-33 
12 i: 5-71 5-81 5 | 5.95 6-02 6-09 6:15 6-21 
13 1 5-61 5-71 5-80 86 5.93 5:09 6:05 6:11 
14 "ea 5:53 5-63 5-71 | оло 5:85 5-91 5:97 6-03 
15 en 546 5:65 pi | | 572 5-78 5:85 5:90 5:96 
16 5:40 5:49 SP | 5.06 5-73 5:79 5.84 5.90 
17 ae 5-35 5-44 5-52 5:61 5-67 5:78 579 5:84 
18 21 5-31 5-39 5-47 5-57 5-63 5-69 5-74 5-79 
19 517 5-27 5-35 5-43 5-53 5-59 5:05 5-70 5-75 
20 514 5.93 5-31 5-39 5-19 5-55 5-61 5-66 5-71 
24 5-11 5-20 5-28 5:36 е 544 5-49 5:55 5-50 
30 5-01 5-10 5-18 5:25 | 5-97 5.33 5:38 5-43 5-47 
40 492 | 5.00 5-08 5:15 | | SÉ 5:22 | 527 5:81 5:36 
60 4-82 490 | 4-98 5-04 Г Bon 5-11 5-15 5-20 5-94 
120 4-73 4:81 | 4-88 4:04 | 4-05 5-00 5:04 5-09 513 
% 4-64 471 | 478 4-84 | 4.86 4-89 4-93 4:97 5:01 
S 4-55 4-69 4-68 ала MEME P КӨКЕН 


466 Table 3. Upper 1% points of the studentized range 
Р | 
2 3 4 5 6 7 8 | 9 | 10 
р | 
| | 
1 | 9003 |1350 | 1643 | 185-6 | 202. _ 
2 | 1404 | 1902 | 2229 | 24-72 "3 2970 | 2456 
53 8-26 10-62 12.17 13-33 14:94 30-08 31:69 
4 6:51 8.19 9:17 9-96 105 | 16-20 16:69 
5 5-70 6-98 7.80 545 08 1L93 | 12:27 
2 à 997 | 1024 
6 5-24 6-33 7-03 БЕ рат | 10 
7 4:95 5-92 6-54 7-01 8:87 910 
8 4:75 5-64 6-20 {ез 817 8:37 
4.60 5:43 5-96 6-35 7:08 7:86 
10 4-48 5-27 5-77 б 7.33 749 
п 4:39 5:1 1 ің 7-05 7-21 
B | ааа] se bes 2n 6-84 6-99 
13 4-26 4-96 5-40 S 6-07 6-81 
A 421 4-89 5-32 а 6:53 | 667 
417 4-84 5-95 5-56 6-41 6-54 
16 1. ў 0-2 6-44 
E a | |500 | se| > 5. 4 
18 | 407 514 | 543 | 56 5:8: 501 Bm p (20 
Б 410 5-09 peg l 58 . 615 | 6327 
4-05 4-01 5-05 mae | 5- 5-94 608 | 620 
20 4:09 4-64 ра 5:33 “55 к. 4 : d 5-2 
24 | 396 | 4 Hed Me. 3 3 ! в | o 
. -55 ( 5 5-08 
| i| 28 | 4 | ou | „| s | ce | * 
40 3-82 aan re 5:05 D$ | x 6 | 592 
| а um | 5- OBEN ZEB 
120 3-70 4:90 59 4-82 Ы j | 550 5-60 
5o 3-64 419 0 4-71 $ ў : 5-36 5-45 
4-40 4-60 | | : 5:21 5-30 
|]. | CT 
п 
v x: 18 14 15 
16 17 18 19 ш 
жен a | | 
А а 260-0 266-2 кен БЕН 
р 33-40 ) { 2770 : 0 
3 | 1713 | 17.53 En 13 | 3481 | 3543 E^ 8 |2863 | 290-4 | 2943 | 2995; 
4 12-57 19. 89 18-22 18-59 "00 36:53 37-03 37-50 37 
5 | 1048 St | 18:09 | ыз | 1202 | 1881 | 1907 | 1932 | 1955 | 18% 
1070 | 1089 | 1% $53 | 1373 | 13, 4| 1440 
6 9-30 08 | 11:94 91 | 1408 | 1424 93 
9-48 9-6 1140 11.55 11 
7| 855 | sm | age | 981 | 995 | 100 55 | 1168 | 11-81 ji 
8 | 803 | вав в | 900 | оло 999 | 1021 | 1032 | 1043 1р 
7-65 ГА 8:44 8:5 9-35 9- 9:55 ; 
10 78 7-91 55 8-66 46 9.03 
7-36 7-49 P 8-03 8-13 4 8-76 8-85 8-94 51 
п 713 795 » тт 7-81 d 8-33 841 8-49 $23 
D | es | qos | 219 | 746 | zug | „„ | ^99 | 808 | 815 96 
13 | 679 | бу БЫР лов [o 85 | т Я 7.38 i 
14 0 7-01 7:36 7-44 7-81 7 7.18 
6.66 6-77 7-10 74 7:59 75 7:66 5 
15 5 6:87 3 9 797 59 75 
6-55 6-66 696 6-96 7-05 Wa 7-35 7.49 1-48 7.89 
16 | 646 | 6. 684 | боз 3 | T20 | тот | 733 26 
17 6-38 A 6-66 6-74 8 7-00 7-07 7-14 7-20 T 5 
18 | 6.31 8 | 657 6-66 82 6-90 71 
19 | eas | СИ | 650 | 6 97 | eg | ger | To | 70) | qob 
25 s 6-34 6-43 Ба 6-65 ais 6-87 6-94 7-00 6-97 
:19 6-98 637 "51 6-58 ве 6-79 6-85 6:91 6:89 
24 6:09 қ 6-45 6-5 5 6-72 ; 6-84 .82 
611 Bf 2 6-5 6-78 68 
30 5:85 Я "19 6-26 0-69 6-65 6-71 6-77 
40 = 593 | 6401 E 6-33 6-39 6-07 
9 5-76 58 08 6-14 6-45 6-51 6-56 41 
60 5-53 5-60 25d 590 | 59 6-20 6-26 6 6-36 б 
120 | 537 | ga | рш | 5% | sz | 902 | eor | Giz | 616 | бш 
co 5:23 | боо e 556 | 54 RM 5-89 5-93 597 MT 
5-40 5-45 bus 5-71 5-75 5.79 5:65 
5-54 5-57 5:61 
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ON THE DISTRIBUTION OF THE EXTREME STUDENTIZED 
DEVIATE FROM THE SAMPLE MEAN 


Ву К. C. 5. PILLAI* anb BENJAMIN P. TIENZO+ 
The Statistical Center, University of the Philippines and the Philippine 
Statistical Survey of Households, National Economic Council 


1. INTRODUCTION AND SUMMARY 


Тһе distribution of the extreme deviate from the mean of a sample taken from a normal 
Population and that of the studentized extreme deviate from the sample mean are studied 
and percentage points are obtained for the latter distribution. This paper deals only with 
Small sample sizes. | 

eta, 5 Vox... 


of the « < v, be an ordered sample of size n from a normal population, the mean 
e sample and s, 


‚ the square root of an independent mean square estimate of o2 based on v 
egr 56 , Mod 
Ж ү of freedom, where o? is the square of the standard deviation of the normal population. 
һө) чам ын к 
п, the distributions considered are those of: 


- uc(r—x)e ог (2-24) (1) 
" 
М ta = (®„—®)/8„ or (2- ау) [5,. (2) 
Е Ы (1935) derived the distribution of wand indicated its usefulness in testing outlying 
ery 


ations. Nair (1948) made further studies of the distribution and tabulated the 
bility integral of u for sample sizes varying from 3 to 9. Grubbs (1950) also studied the 
4 о problem independently and obtained tables of the probability integral 
Th sample Sizes ranging from 2 to 25. 

Ө distribution of w as given by MeKay (1935) follows the recurrence formula 


wh, nue Jom | iT / la = ) na). ш 


ere, f, x қ , — Е 
in the following Nair (1948), №, (и), the cumulative distribution function of u, can be given 


Proba 
Same 


e form 
л үт 
F(u) = ИП ра G., (nu) 
and 1 т 2 
Where 9да) = | |] ай (Go(x) = 1). 
Nair (1 E: 


1948), following Hartley's expansion (1944) for what he termed 'studentized 


an de › Obtained percentage points of the distribution of t, for sample sizes n = 3(1)9 
6 Sa жер of freedom v = 10(1)20, 24, 30, 40, 60, 120, оо. David (1956) showed that 
entization procedure employed by Nair tended to become unsatisfactory 


) : for 
"= lo "à Provided amended figures, using the same values of v and adding results 


for 


Таъ : 
ulati s 
here a tion for y < 10 was not, however, attempted by previous authors and we shall 


e L А г 4. 
А * Unite % study of the distribution of ün and also t£, for n = 3, 4, 5, and р < 10. 'The 
Stag; еа i A : е 
tatistica] a Pig Senior Adviser in Mathematical Statistics and Visiting Professo 
d аш; University of the Philippines. Now with United Nations, New York 
or Statistician, Bureau of the Census and Statistics, Manila. 


т of Statistics, 


9w Se 
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percentage points for v < 10 are 


ion to 
ication t 
roblem and an M merid ia 
i > formulae 
percentage points derived from the formulae g 
Somewhat more extensiv 


up to = 12 given below in the paper by Pillai ( 


* gizes 
e tables covering sample 
1959, p. 473). 

2. DEVELOPMENT OF CERT 


Pillai (1950) has developed an expansion of th 


AIN SERIES EXPANSIONS - 
"- 5 79 he fo: 
€ normal probability integral in t 


[rea = wetu? 
0 


(4) 
law +... +44 +...) 
where а; follows the recurrence relation 


(5) 
3(2i--1)a,— a, = (- D f3ir( 4 1)]. 


The series (4) has been shown to be absolutely 


van (1954): 
convergent by Pillai & Ramachandran ( 
Now Substituting for w, 


"зау? y qt. 
(3/42) x in (4) we get an expansion of the integral | 


€ Series by 
6 
MAS ius d В (6) 
f edt dm BU ttt. араны.) 
42 
then (2i 4 15, = 


Tf we denote the resultin 


(7) 
ЛАК (— зу, 


ran (1954) that 
7a, < 1130; 


Which shows that series (6) is absolutely 
are given below: 


and hence == 99/601, 


+ ants 
fficien 
convergent, The values of the first few b coe: 


by = 0-295, б = —0-003,287,337 7, 
by = —0:032,149 857, by = 9-000,837,638, 99. 
b, 0:018,080,357, 
3. Tug DISTRIBUTION OF THE EXTREME DEVIATE FROM THE SAMPLE MEAN 
In this Section the distribution of the ex 
norma] Population 1 


is develo 


ped in series 
3. From (3), falu) 


m forn — З, 4 and 5 
can be obtained by putting n 


8) 
3/3 % ( 
fa(u) = “Pete f “ед, 

п 0 


Transforming * = t[J2 ang using the series (6) in (8) we get 
9/3 
ТЕТЕ 


(9) 
pes SPHERE e 
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" In f.(u), if we retain terms up to and including 6; ий and approximate the remaining 
erms in an exponent, we get 


( 


қ Dials " x " 
Лаби) = СУ ec (w+ byu” + bu? + byu? б ил езгі), (10) 


The ср.р. 


ee. of u can be obtained from (10) as a series of incomplete gamma functions, and is 
1 lven by 


mo AEST C: : е a 

(0) = E73 М I heit 1) P( +1) 36-91, (6) гө) (11) 
Where "х 
уеге r= 3-0/6; and (р) = | e iw? di[V( p 4 1). 


J0 


If w а — a5 
We make U tend to infinity in (11) we get the total probability. The total probability 


1 give ed for (ао) in (11) is 0-999,262. For the 95 and 99 95 probability, Grubbs (1950) 
of bao. bod and 2-215, respectively, which when substituted in (11) give probabilities 
accuracy ie and 0-989,882; hence the expression (11) was considered to give sufficient 

трча " үзеді. the upper percentage points. 
54. Lo obtain у, (и) we substitute n = 4 in (3) and use (11). We get 


8 
\/(бл) 


езі p (su). (12) 


Лии) = 


(и) іп (10) by parts in the interval 0 to $u and substituting in (12 we arrive 
УТ 3 5 


Jalu) = Afe- [p + eS (co си? +... си + e"? (es Heut... е,и19)), (13) 


Where cM 
= £571,541,8, b= 0-402,768,92, d = — 3-786,324,8 and 


f с; 4; 

0 — 0-409,523,81 0-006,754,891,7 
1 — 0-203,174,60 0-021,072,953 
2 — 0-270,899,47 0-032,807,204 

3 — 0-030,099,941 0-034,181,266 

4 — 0:060,199,882 0-026,658,466 

5 0-016,633,060 


Bratin T 
ена в (13) in the interval 0 to U we get F(U) which could be shown to reduce to the 
orm Involving incomplete gamma functions: 


kyu 6 
aU) = 5 қ з В 
5 ) ВГл) Г) € € У filholi -p Г 4D x 9:1ара(— 4) 702—4), — (14) 
“гө В i-1 i- 
7 1127,546,0, 0 = 1-523,847,3, D = —0-002,141,004,0 and 
k Л 9: 
1 — 0:336,458,14 — 3:106,164,5 
2 — 0:050.077.491 — 3:053,611,7 
3 — 0-020,030,996 — 1.257,977,7 
4 — 0-000,667,699,88 — 0-345,494,26 
5 — 0-000,400,619,93 — 0-071,165,578 
6 — 0-011,727,059 


Extreme studentized deviate from the sample mean 
4'10 


ints 
bility obtained from #, (0) is 1:000,061, and for the 95 and 99 95 poin 
dr жарға көр we get the values 0-950,062 and 0:990,063. 
i u > А чы 
ane y а From (3), putting n = 5, we get 
iii) n = 5. 


: "EG 15 
Л) = У ere pp) (15) 


ш) in i i E rive at f(u) in the following form: 
i 13) in the interval 0 to ти, We arr 5 
Integrating f,(w) in ( 


Л5(и) = В [hu из + hus + hyu] ; 
u 
ULM TENET + iyu? tigu +i, u?]4- Fete | e? dw 
fu 2f 
taetae f eT" dy 4 H eie ete? dap) | 
0 0 


Е = 0-402,768,92, 


0 
(16) 
where 2 = 10-195,247, 


G = —0-463,009,52, H= 0-013,920,354, 
a = —6-541,132,5 and 


4 hi p" 

0 0-066,857,143 — 0-008,956,828 
1 0-099,867,725 —0-021,607,151 
2 0-042,713,845 —0:031,069,928 
3 0-043,058,311 — 0-029,234 091 
4 


— 0-016,364,940 
4. Тнк DISTRIBUTION О 
The distribution of b = (%,—2)/s, or (—2,)/s, 
in the previous section and F(s,) given by 


F THE EXTREME STUDENTIZED DEVIATE FROM THR SAMPLE MEAN 


could be obtained by using f. 


n 


(ш) derived 


Ў(8,) = ЛЫ (% 


v)] $1 eds? 
Since s, is independent, of u, the si 


(17) 
multaneo i 


Fws) = Ішу (s,). 


In (18), if We use the transformation 


i= uls, and 5, = 8, (19) 
: Let us Study the distribution of ts, ts 
> applying the transformation (19) with n = 3 and 
Set f(s). ft.) сап be shown to be of the form 
f) = вузы Г Al 3 Meu wga 

: байы 2 Mona 
"Аы (ыы) (на) 


b 2 1o. 
вм ШЕ) (20) 


іп ӛһе interva] 0 to co we Bet f(t.) 


(i) n= 3, Multiplying (10 


bs 
2 »)- 28 9 
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Ж Terms involved in Ді) сап be reduced to /-functions since ty varies from 0 to oo. Hence 
пе C.D.F. of /4 can be given in the form 
Ры) = K | X Tri EE D977 [Poi yea +6) Гө 1 - 6j do 
а а е A 
(21) 


where К = 4:961,960,1, L = 1,868-775,6, 
p= 3-509,615,48/(v + 3°509,615, 415). 


І 


a = 3t$l(v 38), 

ta arriving at (21) the transformation used for all terms except the last one in (20)is 0 = о, 
and in the last term of (20) is 0 = £. 
Alternatively, for odd values of v, it is possible to obtain the distribution of t separately 


for each v in a convenient form involving f-functions. Тһе distribution obtained in this 
multiplying f(u) and f(s); using trans- 


m mi А 
anner is exact. For example, when > = 1, 
espect to s, by parts, we get 


formation (19) with р = 1 and integrating with г 
fiat) = Зе + 2) (62 + ИН (22) 
6/(1+68) 
ни Falls) = BDM] | (1—0)-3 (1 — 36/4) d£. (23) 
Jo 
The other cases also could be attempted in a similar manner. 
(i) n = 4, Similarly, using fa(%) given in (13) and f(s,) in (17), we get 
1 2+1 5 бей "m 9%--1 
Л) = 44 Ln Y оре) г) + У е(2/7-572,649, 6) T| 737 
Б " ЕГІ ы i=0 Q4) 
ала у {kv 1)) | ™ giai — 01 do 
Fi.) = мә | 6-Ң1-0) 
(14) = Mb T») A | 
4 е Tfa(v4- 2i 4-1) f” лего (1 - 0i 0 
+N У с100-3)% +9 m 4 gawu ) 
1-0 2 Я 
5 Iv 3i 4-1) Г giei-» (1 ya 46, (95) 
E 0 P MEN eee 
+N X «(—@) ndn Г(%») 0 
i-0 


N- 2-285,770,9, m = 413/(30 + 462). 
2,649, 6t5/(P + 7-572, 649, 64). 


r to that adopted for fts). 7%) for 
as well а8 the corresponding C.D.F. S. 


different cases. 


Wher, 
"bie М = 2-799,486,2, 
p = 7:57 


Wes 20/2/(3v + 20), 

10) n = 5. The method used for this € 
e ia. 5, 7, 9 and 11 have been evaluate 
expressions for the c.p.r. are given be 


fis. dies | 
(;) = M s Га Ж oye» = 56[8) 49 


ase 15 simila: 
d separately 
Jow for the 


450 б 
+ юс» к. ejes 250/287 “240 
i-0 * 0 
ve p " 9 
ce Диа желінді (end 
- i = . (26 
+ 3 1, 9 | $ келемін (v= 1, 3, 5, 7,9, 11) ( ) 
2и] (1 oye» oidó 2,74), it i" 
i iyen in Tienzo ( 


Th 


e Values of ‚8 


[АГ 


Гала, М, are 6 


т, U; №, bys M v'i 


їр 


419 Extreme studentiz ed dev ate f от, the sample mean 


5 PLE MEAN 
у EE HE SAMPLE № 
5 ER > МЕ IDENTIZED DEVIA E FROM T acne 
PERCENTAGE POINTS FOR THE EXTREME STUDE TIZE VIAT y М 
| ^ " d lfr m the formulae for I (ta) giv 
ta ints for n = 3,4 and 5 were computec ГО 

Гһе perce ge points = ә, | ч i 

x revi E ction. For = 5 and even degrees of freedom, the 

p ious se Я act 

' erpolation. For n = 2 the values were conr 
i Ola š 4 i қ 
oa ntage points of the studentized range (F 
bercentag: b» е 

: por ated in th more extensive table of 5 
cor e 

printed on p. 473. 


alues were obtained by 

тез ling 
puted by dividing by 2 corresponding 
illai, 1957). 


d en in- 
All these values have been 
and 1 9 


; = 12, 
points for sample sizes up to n 


6. APPLICATION OF THE TEST 
Cochran & Cox ($4-23, 1950) have given à 
carried out by the North Carolina tural Experiment 


: - 36. 54, 
ment tested the effect of five levels of application of Potash, supplying respectively 36, 

72, 108 and 144 1b, К.О per acre, on the yield and Properties of cotton. The 
arranged in three randomized blocks of five plots each. The 
respectively 7-85, 8-05, 7:74, 7-51 and 7:45 with gener. 


š ‚< experiment 
an example of a randomized block expe 


Т ерегі- 

| station in 1944, The ex] 

Agricul Station in | 

experiment was 
у 1 . Y геге 

means for the five levels w 

al mean 7-72, 


Analysis of variance 
Source of variation 


D.F. 5.8. M.S, F 
Replicationg 2 0-097, 1 = E 
Treatment 4 0-732,4 0:183,1 419 _ 62 evel 
(Significant at. 95 Yo at 
but not significant 
the 99 % level) 
Error 8 0-349,5 0-043,7 = 
Total 14 1-179,0 — = 
For testing the difference of the largest mean from the general me 
Referring to Table 1 on p. 473 is i 


alue of t; 


e 
is 2-24, Referring to th 
5% level. 


— 
Statistical Center, Unive 


aper 
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Шрртев. 


Upper percentage points of the extreme studentized deviate from the sample mean 


Bv K. C. 8. PILLAI* 
The Statistical Center, University of the Philippines 


Let ау < Ta 5... < т, be an ordered sample of size n from a normal population, %, the mean of the 
sampla and s, the square root of an independent mean square estimate of o? based on v degrees of freedom, 
Abre G?*isthe square of the standard deviation of the normal population. Then the extreme studentized 
каме from the sample mean is defined as £, = (c, —3)/s, or (z—25)/s,. Table 1 gives the upper 5 and 

% points of 4, for sample sizes ranging from 2 to 12 and degrees of freedom from 1 to 10, thus filling the 
Вар that existed in the tables of upper percentage points of t. 


Table 1. Upper percentage points of the extreme studentized deviate from the sample mean 
(x, — T) is, or (£—23)/8,, 


te 
"n | | 
| 
2 | 3 | а | 5 6 7 8 9 10 12 
v | | 
| | 
waa, — 
5% points 
ж" 
| | | 5 E 25 26 
2 И [E pu | ч r^ p^ p^ Т1 75 
3 С Foe be | Pis | 436 | 455 | 472 | 486 | БП 
4 ШЕТІ | 956 | s4s | 365 | 380 | 393 | 405 | 424 
96 | 258 | 298 | 326 | 348 | а de 
а L82 | 237 | 271 | 295 | 315 | 330 | 343 | зы | 364 
| Ой ай 31 | 331 | 3:39 | 3.54 
> | wm | see | sss | zoe | 205 | 20% aoe 315 | 323 | 337 
8 HET | jig | Se | 959 | PE | eis 295 | 304 | 312 | 335 
9 1:63 2-09 2:97 | 2:57 | 272 es aie 2.96 3.03 3:15 
10 160 | 204 | 232 | 251 | 265 | 7^» | оз | 289 | 296 | 308 
1-58 2-01 2.27 | 946 | 260 | 272 | ^ 
| ки quM жәе E 
E | 


1 9$ points 


| E 3 | 119 123 130 
5 45-0 68 | 82 93 101 108 МЕ > 156 |161 16:9 
3 7-0 9-9 11-3 | 12.6 | 13:6 | 144 ed 84 86 90 
: T- "sn Я j :68 
4-13 55 | 63 6-9 7-3 n ў 622 | 639 | 66 
5 326 | 423 | ам | 523 | 554 p Pt 526 | 539 | 56? 
2-85 3:65 | 41l 4:45 | 470 " - 
238 | mes | в „ | ga | 28 | 555 en 446 | 463 
7 ы 22 | 3-32 3-72 402 | $94 | 41 4-25 13 | 422 | 438 
8 "48 3-11 348 | 374 | 394 | | 402 | 6 2d 
9 237 | 296 | 331 | 356 | 374 23 | 386 | 395 A | - 
10 2-30 2-86 319 | 341 | 359 4” 3.73 | 382 | 390 
ч; 2.94 2-78 3-10 3-32 3-48 
* ina a 00 == == rofessor of Statistics now 


tics and Visiting P 


ы Unit, eos ratte 
Т. th is Nations Senior Adviser in Mathematical Statis : ncluded for the convenience of 
% The on Nations, New York. 10 are not new but are include 
table has з for n = 2 and the rows for » = Biom. 46 


3o 
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as in i f 
== m i i z taking one-half o 
ы 5 f trial values was obtained by ta 1] 
three steps. First, a table o y v depen 
Table 1 w: possem, e points of the studentized range, q = (x, —2,)/s, from Pillai | usi a 
i 6 аа iix = вач а noting the differences between them and the percentage | 
justin; es ter £ 
реттер - David (1956) and Pillai & Tienzo 


Secon: i з and 
(1959). Seco dly, based on the table of trial values an 
у: vid" lving the equation 

employing David's method (1956) of solvin, q 


© 1) 
| fus) | -— f(s,) ds, du, = а, ( 
0 0 


- ne 
and u, the extre = 
2, 4, 6 and 8 and values 


5 jf the 
vo neighbouring values from 

е same value of n. 
Iam grateful to Professor E, S. Pearson for his Suggestion of fi 
ienzo, 


Г ла 
lling the gap between the table of Davi 


Aurora Abesamis for the computational assistance 
knowledge the facilities offered by the Statistical Center in the 
Davin, Н.А, 
[xe А Concise Tables Лот Statisticians, 
Philippines, 
Pura, К 


: А r the 
The Statistical Center, University of t 
(OS. & TrENzO, В. P, (1959). Biometrika, 46, 467-72, 


in preparing Table E i 
preparation of this pape 
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(1956). Biometrika, 43, 449, 
GRUBBS, Е. Е. (1950). Ann, Math. Stat, 21, 27. 
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hi 


[ 475 ] 


Tt 2 
he asymptotic efficiency of th Xr-test for a balanced incomplete block а ig 
e esign* 


By Рн. van ELTEREN axd G. E. NOETHERT 
Mathematical Centre, Amsterdam, апа Boston University 


Friedman (1937) has sh 
Servers’ T а Hias own how z treatments can be compared on t! i А 
ie, the ecu ry unt extended by Durbin (1951) to cover а M i 2. 
wit h any other treatn [ - E er es ken t reatments exactly once and each ney cei i Mieten i 
(in the: sens БЕ Bi nent exactly A times. We want to find the asymptotic (7 — oo) relati 3 compared 
s itman, see e.g. Hannan, 1956) of Durbin's test with respect to the ne ueri 
alysis o 


variance tes А 
елан test fora balaneed incomplete block design. 
© note for future reference that 

l(k— 1) 

.-1” (1) 
Theis ы 1 i eatment is used (replications). 
for the у ва ic relative efficiency is most easily obtained with the help of a formula given explicitl 

ave, Жы. ИС ime by Hannan (1956). It may bestated roughly in the following way. If both test satiation 
derees ers the alternative hypothesis, non-central y-square distributions with the same number of 

reedom, the asymptotic relative efficiency of one test with respect to the other test is equal 
Iternatives have been set equal. 


to the rati 
а. of the two non-centrality factors after the а 
ially, then, all we have to do is to compute the two non-centrality factors which we shall denote 


yd? 
Ried, Ше tho rank and F-tests, respectively. The conditions for the applicability of Hannan’s 
conditions т shown to hold provided the underlying distributions satisfy some very general regularity 
in Several cli ae details will not be presented in this paper, since similar considerations have been given 
Beforo.c jer papers, e.g. Androws (1954), Benard & van Elteren (1953) and Bradley (1955). 
computing the non-centrality factors, we have to specify the mathematical model which we 


are ga; 
€ to consider. The usual analysis of variance model suggests the following approach. 
i (а Mis be a continuous cumulative distribution funetion with density function (т) = F(x). Let 
É L= 1,2,...,m; y = 1,2, ...,n)E be the chance variable associated with the observation of the иһ 


server 
ributios vus in tho analysis of variance) on the yth treatment. It is then assumed that the dis- 
mE (x "arem i : 
vie) of Xp is given by FG) = (+0, +9), 
o may assume that 2,0, = 0. The 
У H 
o considered specify that for а giv 


Where 1 i 
lis the total number of times a given tr 


nu 
null hypothesis to be tested is that 


Where, w: 
©, without loss of generality, w 


боз б, 
tions L =... = 0, = 0. The alternatives to b 


en number of replica- 


6, 
0, = On = 


When 
© the 4, are given constants satisfying 
Уд, = 0. (2) 
For thi К 
is model, we find (ө.д. Anderson & Bancroft, 1952, $19:3) that 
d d =10 %0 = © Xo (3) 
Өгө g2 i р қ 
0° is the variance associated with F(x) and 
da =. (4) 
he epi с иеп forl Fh imatel 
tctency г Я Я с ;gn. The fact that for large mM, as approximately 
у factor of the given incomplete block design я ах ta hod given in Andrews (1954). 
rank which the wth 


from an argument si 
llowing notation. Let y 


g that he considers the 


e Mathematical Centre, Amsterdam. 
the Office of № aval Research. | 
зе bold type to indicate chance variables. 
30-2 


А stand for the 


On.oe. 
y Піта] y-square distribution follows 
yth treatment), 


or 
о Comput: 5 
bserver a puting dÈ, let us introduce the fo 
Ssigns to the vth treatment (assumin 


and let R, = Уи» 
Е 


ж Report SP 63 of th 
T Work supported by 
ў Here and below we U 


Miscellanea, 
476 


ге then sot 
timent р. If we t 
tion over those l observers who have rated troeim 
™ denotes summati вн 
scu C Wen Durbin's test statistic is, except for a constant factor giv y 
u, =R,- ЩЕ+1), ne 
X = AMEE] 24 


м arge nira! y-squa: wit! — 1) degrees of freedom. 
istributi В (1 — 1) deg 
i y -Ce l y-s quare distribution 1 
i 1, has approximately а non-cent; 
hich, for large l, 
We then have 


u, 


422019 тн 
шеттету (u,)} 


GR) = Уо у 
Bow is z Хи 
y the yth observer, Since 


ki > Xp xy} = fre F Oxy + Myf (4-0, 4- 7,04 


ӛ%ш-6; d | ter^ | ах) dz, 
КЕЛЕ ЕТТЕН Жа) 


and since every treatment 


(8) 
Pix, ханы, 
where j() refers to the treatments rated b 


P{x 


in view of 
occurs exactly A times together with every other treatment, (5), in vie 
(1) and (2), becomes А 
n 8-6 Uk+1) nà Ж 
ER) AX [1,59 Е BT Ot | Ја) аг. 
18777 2 
744 
пл " 
Thus elu) ~ “2% (=) dr, 
and, йла] dia Due dy Ра) а | Sop » 
iar (54-1) (n — 1) ғ" 

From (3), (4) and (6), the asymptotic relative 
found to be 


2 Р test i8 
efficiency E, of the Xr-test with respect to the 2 


12k 2 (7) 
а) 


Y replacing k by n in (7) 
‚ (7) becomes 
3k (7 N) 
ЕУ )= , 
л(Ё+ 1) 

Some values of which are tabulated below: 

k 2 3 4 5 6 1 8 9 10 17 "^ 

EAN) 064 ong 076 080 age МА Ой Gas ös nos d 

When k = 2, we have the case of Paired com 
two treatme: 


nts. In thig Case, (7) becomes 


2 (8) 
Е, = E [ча] А 
and (7N), 


; sed | 55). 
10 efficiency of hig Т ӘУ Bradley (1955) 


"ano 
? rian’ 
ng our notati test relativo to the analysis E ie D 
Notation, n ig ийке 
erence between Bradley 8 result and Our own result, N) cae the hi cd Bann assume i 
one way classification for the analysis of ith l'observation thers к antes of the anit 
З s 88 
within Бос variability diede model eon ered Since the Variance g2 in (8) clearly refers 19 
If 2 > в More а; 1 pr 
2 де хаар Bradley Sresult by 1/C, where I8 given by T a. 6 : ffective numb 
plications of the alanced incomplete block дез; 1 а 8 шы 
far as the Xr-test ig Conce: ; instead of the cid di S 
wlz) = р 


——— 
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Sb А А 

could have considered the more general model 

"—— Fax) = Е (2 4- 0,), 

iere, , the F (x y à А oUm жі 
gained a n i. ch ale i a: ое distribution functions. Nothing much is 
à £ rested in an incomple ^k desi i i 

шш iade. cool жене ыса ры H plete block design, sinee the non-centrality parameter 

particular block 1 of the Ar-test, w ill depend on which particular А treatments are put in which 
Моск. However, in the ease of Friedman's test when Ё = n, easy computations show that 

=n, j ow tha 


db 220, Ў 2 Ec 
Pe aima 2 fide | >,%. (9% 


where / (т v 

fatx) = Е’ (x). (9) сап be us i 

: n ғ). (9) сап be used to compute the asy n i ү i 

ы neral те I he asymptotie power of Friedman's test for this more 

“с "m < ч $ 
"Se. =n = 2, the y?-test is equivalent to the 
Te ioroughly investigated than those of any other distributio 
E totic efficiency of the sig st ri ivi | i i 

м ) sign test relative to the ż-test, а uantity which i zgi 

& Lohmann. 1060) m i екі 4 y which is usually given (e.g. Hodges 
E,, = 40350). (10) 

| est while (8) refers to the two-sample sign 

h is rarely made. (10) сап be used for the two-sample case if f is interpreted as 

the two chance variables under consideration. From a practical 

ieney in terms of the individual distribution of x 

sample sign test is concerned, it is, of course, 


two-sided sign test, a test whose properties have been 
n-free test. (8) then becomes the 


са = is that (10) refers to the one-sample sign t 
the densit mM Ship 
point ies si i. x -у. where x and y are 
and y. Т n w, it is more important to know the еше 
күрт да n answer is given by (8). As far as the two- 
The dna si 2 direct ly without reference to the gr test. . 
arn ты D ы sata in this paper are asymptotic, i.e. valid for large numbers of replications. Nothing 
in tho mb s SHON pig the relative efficiency of the Xr-test for small numbers of replications except 
james € jw n = 2. Sinec іп this particular case the relative officieney for a small number of replications 
ТОНЫК н зе much higher (in the ease of normality) than indicated by the asymptotic formula, it seems 
for the ев о assume that the asympt otic values derived in this paper ean be considered minimum values 
rresponding efliciencies. 
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xpens din. i i ў shieved at the 
i i bias could only be ac à 
xpected that this reduction in 1 i Aree 
It might, perhaps, de NR = ee ce. However, in his 1956 Баран епо ыс hos 
correspon: gu еаѕ an: 
: n 5 in variance is of small order in ne 
suc! 


i ESI rpose of the 
ompared with the variance itself. Тһе purpose o 


as far as n-?, Later 
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ment in the accuracy of est; 
commonly employed. 


2. Suppose 2 is a 


normal variable wi 
E(x) = 1, and let z = 


th variance О 
1—£. Denote V(x) 


(n7). Choose th 
by h. For su 


е t 
e units of measurement so tha 
fficiently large n 


we have 


E(x) = 144 T 312+ 15434. O(n-1), 
Similarly, E(z?) = E14 26-E 3E?) 


= 3h44 15424. 10543 
From now on У 


2) 
+0(п-1). ( 

? neglect the terms of O(n-1). 

Suppose that r = ylz is à ratio estimator of P = Ey) (т), where tho ге 

the form У = C f/z--u, апа where 


Қ.” f 
gression of y on x is linear 0 
Er а (а-1) 8, Usin 


O(n7!). Then р=а+р. Also 
g (1) the bi. 


uisa constant 6 of 


which is О(п-1), 


3 
alh + 342 4. 1543), Ж 
Now r = B - (a +ми)/х. Consequently 
Е" в) = (a? +8) Efa) (4) 
, ш (988) 1-35 4- 16624. 10515 
using (2). From (3) and (4) we find à; 
V(r) = (№ 4- 8h? + 6953) +014 3h 4- 15h34. 10543), " 
3. On Splitting the Sample into two halves we hay i i = yate 
е the ratio estimators r, — , Гау and ry = Уз 
Where y — Қау) and x = (ш +2). 8 fei m 
Lo 125). Suppose that y, = оф = Tu, W 
Е(и, |.) = 0 and Z(uà | x.) = 200; 1,2). Then u = Қи pi ү а ка 
'uenouille’s estimator is T 
t= 2r— Mr n) 
2 1/1 H "bu 1 
ае 
© 2m | = 2 t 
Since (шу) = (ж) = op, We find i i x 
(р) = 14 644 ШЕТІ ив "РУЗ in (1) ana (2) Бауы 1495.4 1912 4 1901 ап 
2 1/1 1 
Е@—8) = on ft —+— 
ҒАС Ты 
Thus the bias is (6% 4. 9043), 


=a(1~ - 6 
which is O(n-2), i She 9023). ( ) 
Now аі Jr 


4 9 qj 
—— = НЕ E: 1/1 1 9 
ж 245 ft. E 8) ret) 
set. ln 1| 7,1 
mit. 
T а% 4 а 22) 75 D 
=4( Е 
= 1+5 g 168/2, 
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и че)? я TX 2 
= ( 14 ‘| = к =f РЕ: 
Hu d аз 4i? a mm 


= 20(4-+h-+4h?-+ 543). 
E(t— B) = 921+ h — 8h? — 16873) + (1 + 2h + Sh? + 10819). (7) 


Also, 


т 


whence 


Consequently, 
From (6) and (7) we find 
V(t) = V(t— p) = a*(h + 4h? + 1213) + 6(1+ 2h + SA? + 10813). (8) 
Comparing (5) and (8) we see that in spite of the fact that, for sufficiently large n, і has а smaller bias 
than >, V(t) is smaller than V(r). 
4. The error entailed by using the first few terms of the expansion of (1—£)-1 in place of 1/2 is 
Ша (1+Е+... +2) = @/х. The expected value of this is approximately 105. The error in the corre- 
Sponding expansion for 1/27 is 1/22 — (1 +2ë +... +887) which has an expectation of approximately 


945/4, Now h is the square of the coefficient of variation of v. Even for a coefficient of variation as large 


as 2095 wo have 94541 < 435. Thus one would expect the errors arising from the use of the expansions 


to be small even for quite moderate values of n. 


regression of y on ж has the same form as has been stipulated 


5. Let us now consider the case where the 
a)/T (m). It follows easily that 


above, but where а now has the I-distribution with density «™-1 exp ( — 


1 1 Е y" 1 
s() T m-l’ 2) (m—1)(m—2) 
Taking ж = (xı +v) as before, and supposing that 3v, and ја, are independent Gamma variables with 


Parameters 1m, we have " 
T 1 IY. =й 
(2) = me’ z(a) “agma oO 


Ti 
E(r-B)— Sul" Tie =p) (m—1)(m—2) 


m 
Thus we have 


с AL ©... (9) 
onsequently the bias ofris must a тт- 1)’ 


and the mean square error of r is 


a 2 
M(r) = E (-5-0) 


p a? 
= Б 8) BrP) + 
1 2 H mo 
= ets 0-2) mm- Пия) (n-Dn-3 
8 (10) 


a*(m + 2) 


> mm —1) (m—2) tm- 1)(m— 2) 


Ав before, Quenouille’s estimator is 
$ ari i ане тана), 
i pes [- nz) т т 9\а 2% 


т 94а Ta 
о(т-3) 
EN. uo e 
"енй 50-0) = (т 1) (m—2) 
2a (11) 
b ү" 
Thus the bias is 7mm 1) (т 2) 
y.dy 211 
1/1 1W [5,1 1-55) 
Ав before, E) [+ ait ag) amm 
т ад 4 
2 2 1(1,l -5 
and s [55-562 E iss 
% 24 2% 
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1 7 | 
4 es * жы 9)2 
E(t- f) = “(> 1) (m—2) * 3m —8) (n—4) 2(0n—2) 
Thus 


2 1 2 | 
9 ax 2 
la 1)(m— 2) TE 2)(m—4) (m— 2) 
a? (m? — 8m + 19) n S(m*— Tm4 18) E 
~ (m=1) (т 2)? (m — 4) (m—1)(m—2)? On — 4) 
Consequently the mean square error of ¢ is 


8 Р 
Mit) = (в): pq. gy а 


т? 
97 (m? — бууз 4. 12m + 16) 


6(т? — 7m + 18) И (12) 
= is - -20(m—4) 
т%т- 1) (m — 9)? (т e % 1)(т 
Comparing (10) and (12) we have — ы m | - 
Mie) — Mie) = mim—1j (m): may + i -1)(m— 2j? (n — 4) 
Thus the me: 


is certainly less than that of r 


ille’s estimator 
3uenouille's estirr ple 
result for any ват For 

; > size. 
m smallness in sample siz ight 
'actieable and m Ей 
кре аз! 
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" erro) 
terms of mean square 


or 
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Course, an estimator which has both s 


Y preferable, from 
est to quote the results obtained by @pplying the method to samples of two ther 
the Population of four pairg (x,y) used by Goodman & artley (1958 to compare a number of 0 
types of. estimators of the Population mean of the y’s, The Population valu, 
uenouille's estimator Yq апа the four estimators 
mean square errors, ате: 


3), (6, 10), 
es ато (2,2), (2, 6), (4, 6), (ы ан 
Considered by G ап and Hartley, together 


Mean Square error 
x cc 3(y, y.) EA Vs » 
dl e аға, в 
s Tant Уз 
y= 3 Pw 0:917 
257 СА Уз 
шет Е is 2-407 
^u [luy КАЕ: OA % 
y in ҒА E [ыш ata) 0-563 
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On the probability integral transformation 


Ву C. L. MALLOWS 
University College London 


1: Шер, d. ...,4 
dependi р 
pending on s parameters, where 


Ка,.б.0..2.0)>0 (av < 0). (1) 


If the parameters are known, the probability integral transformation defined by 


i; 
= | Це. ®Ө„б„-..,0)ай (a & x; <b, = 1,2,...,п) 
a 


defi қ ; ; : 
efines n random variables 21,24... .2% which are independent and are rectangularly distributed in 


(0,1). However, if we estimate the parameters by means of functions 


Flag; n) Falt etn) o ees FG) 


of — . ^ ; қ 
f the observed. values (whieh funetions we may assume аге not funetionally dependent), and define 


? new random variables by 


т 
i= Í S(t. Еу, Fo. weep Dt) dt 
a 
= gto Fo Fe -ӘҒ,), вау (ac b, i= 1,2,...„т), 
be rectangularly distributed. 
joint distribution of the y’s will 
e values of the parameters 


thon in general these variables will not be independent, nor will they 

2. David & Johnson (1948) have remark 
depend on the form of the distribution of the 
in that distribution. Barton (1956) discussed tl 


the null hypothesis is composite, and Chernoff & Le 
related problems with reference to tests of goodness of fit. In these cases, the typical result is that the 


asymptotic distribution of the criterion considered is that of the sum of a y? variable and one or more 
Weighted squares of unit Normal variables; there is a partial loss of degrees of freedom relative to the 
Саве where the parameters are Barton shows by example that the amount of this loss may 
depend on the form of the null law, the particular estimators used, and even on the values of the 


ed that the form of the 
a's, and may depend even on th 
nese results while investigating Neyman's y*-tests when 
hmann (1954) and Watson (1958) have considered 


known. 


present note we show that the pheno- 
funetional constraints between the 
& Johnson conjectured that the 
Tt will be shown that only in 
does not correspond to 


Parameters. 

As a contribution to the study of the small-sample case, in the 
menon observed by David & Johnson, 
V's, is not at all general. On the basis of particu 
rank of the Jacobian J (see (2)) will in general lie be 
Special cases will the rank be less than n. Notice that this los 


that in some cases there are 
lar cases studied, David 
tween n—s and n. 
s of dimensionality 


the loss of degrees of freedom mentioned above. 
3. Let io = goi КЕ.» F,) = f(t, Fo en Fa) 
0 
dy sri бына P agi Ps „>В, 
ЗЕ, fir 
um. Ww 
Жазы, 775 Io 
for Qd gs Pe dcum 
(by (1), gio > 0 fora < x; < b). Then we have 
Mi got Ха Ғы @= b^ ms 
ӧл; тті 


Тһе Jacobian matrix J of the transformation is given by 
n 

(yas Ин) = TT go +91, 

latoon] dl 


(2) 
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I,, is the unit n x n matrix, G is the n x s matrix (Gir) and Е 
where I, is sone 
partitioned (n + в) x (п + з) matri: (^ am n) 
А = 5 


0 1 


ix (F, ‘onsider the 
is the s x n matrix (F,,). Consider 


р 
where I, is the unit з x s matrix. We have 


A(T) = pL, - GF) = p(A) — s. 


In 0\ үт, GY[IL,4GP “tT ee °) (% x) 0 |е, say. 
B 5 1.) (0 2, о 1) \-r)\o 


7, 6 І,-ҒС а 
АП the multipliers on the left-hand side of this equation are non-singular; hence 
7-р() =n+s—p(A) =n+s—p(B) = 8—р(І, + FG). —— 
Tn general, 1,4- I? is non-singular, and so P(J) n; only when some special relationship иа j doos 
f and the estimators U^] will p(J) be less than п. We remark that the form of this relationshi] 
not seem to have anything to do with maximum likelihood 
Ifp(J) = n—8, we have Ю(1,+ FG) = 0, i.e. 
I,+FG=0 , 
and conversely. Two lines of development for this case are now available, Wo may study either: \ 
Problem (a), to find which forms for f(v, 0,, ез 0,) satisfy these в? 


p 5 Fp; ог 
to inquire what 
to satisfy these е 


4. Problem (a). Suppose s — 


2 S wS 
equations for given estimato 
Problem (b), estimators Жі, 


quations, 


2. For p(J) to bo n= 


а А i.c. given Gir) 
s P, should be used in апу given case (i.c. giv 


2 we require 


т gu OF, п у оп 
а, » B a 
із1% €x; 1517 Ox 
А ї=1 io Ox; 1519 Ox, 
Now if, for example, 


=. 12 
Шы Psem У ив (v 2 nor n—1), 


т-(а,-2) 
v 
we find the most Beneral forms for Gaand G iz are 
| 
Ga — ~1; Фа 2-2 
io io 25 
In order to find the most genera] form for 2, we must now solve the Simultaneous partial differential 
equations T 
og ёд 
ME = 0, 
Ea % ёд 
20, / x ag, = 0, 
the genera] Solution of Which (Piaggio 


g(z, бб) = һ 


2—0, ү 
770; (h arbitrary), 
For the case в = 1 we find in a Similar 


fashion the Solutions 
Pls ез 
r g(z,0) = hipo) (=— 0)) (Һ,ф arbitrary), 
= ад Үз): glz, 0) = ^($(0) Q^ (ar) —-0) (h, ф arbitrary), 
1 n 
те т =7(2 
nds (а-а %а,0)-) (5-ға) (hd arbitrary), 
For the case s — 3 with 
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кейі We — for g which will make p(J) = n— 3; it is, howe : 
general solution for which p(J) = n —2 Ph з en opa m the rank 
is an integral of риф 
where u = (x—0,)/ du + (9100) + uda) + и%фу(ш)) dw = 0, 
(= 0,)/03, w = 04/04 and h, фу, Фз and фз are arbitrary functions. 


5. Problem (b). Из = 1, and g = h(x— 0), we require 


This will be satisfied if F i 
is will be satisfied if F is any (differentiable) function satisfying 
F(a, На, Tata, -Cn +a) = F(Xjy 9g sies Bq) +E (3) 


for all allow able x and а > note that this 15 ition 
llowable x 1а. W isi 
о 1 his is t! i iti isti 
is is the standard condition for the statistic F to be a ‘measure 


of location’. 
If s 6 
fs = 2 and g = A((z—0,)//04. we require 
п OF, п 21-Е, ӘР; 


1— № = 0, 

MA or, а ЗЕ; 85 0, 
п r A 
ізі Ox; 


hese . 
These ec stad EP stis des 2 
juations are satisfied if P, satisfies condition (3) and also Fy (Ary, Ка, +++ kxn) = KF (у, Va уда) 


while F. satisfi 
» satisfies 
Р(х H4, Ug n +a) = F(p Vo 59%) 
Eva) = Ех toss Un) 


(Кау, Кто, + 
ese are the standard conditions for JF, 


all allowable x, а and k. We note that th to be a ‘measure of 


dispersion’. 
the usual cases; there are, however, 


the above forms; for example, the y 
e sample. In this case 


ver a large proportion of 
h do not fall into any of 


б " А 
The above particular solutions co 
neter is estimated from the mean of th 


nu ^ б 

(ору din important distributions whic 
X?) distri i hero 1 є 

P) =n. ribution where the index parat 

REFERENCES 


of goodness of fit when the null hypothesis is composite. 


в D. E. (1956). Neyman’s ys test 
бы ан Aktuar. T'idskr. 39, 216-45. | 
d Herman & LEmwANN, E. L. (1954). The use of maxim 
lu n goodness of fit. Ann. Math. 
a г. №. & Jonson, N. L. (1948). 
Бенде estimated from the sample. Biometrika, 35, 182-90. 
АБ, o, Н. T. Н. (1928). Differential Equations. London: G. Bell and Sons. 
SON, (+. S. (1958). On Chi-square goodness-of-fit tests for continuous dis 


Soc. B, 20, 44-61. 


um likelihood estimates in № 


Statist. 25, 579-86. 
The probability integral transformation when parameters 


tributions. J. R. Statist. 


Extrema of quadratic forms with applications to statistics 
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duction and summary. In many statistical problems, one is frequé 
atie form or io of quadratic form: 
r of such pro 


*rminati 

1n 

Some eee of the extremum of a quadr i ; 

Mia р on the space. In this paper we consider а num е hee nh 

ш » (ii) a у су discrimination, (iV e т 
u ) a moment problem, (iii) discrimin (iv) eat itd. шы өт, E 


tio. 
ns в 2. 
mu may be obtained аз а consequence of two inequ 
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ples. In what follows the notation E(x) and V(x) will 
Some exam : 
respectively. 


(1) Stratified sampling. Lot X4,,X.. 
finite population consisting of Lk strata, 1 
ф= 1 k. А simple random sample а, "Tin (i= 1,. 
b... 

is given by 


iance 
denote the mean and variance, 


Г. = be а 
Хы» М. +... TN. = Nis : a 
91 and intra-stratum үа к ү 
i 'stimate 
-+ k) is chosen and ап unbiased estim 
V. = УМ, М, 
where z; is the sample mean of the ith Stratum. The variance of 3 


T, is given by 


E №52 k №53 
V mac ba UND ы. 
Уа) E Мө à № 
The problem of op 


timum allocation is to determi 
let 


; he 
T fs ct to t 
NE 75,...,n, во ав to minimize (т) subje 
restriction an, +... tarn, = с. If we 
т = (п, ee) т = N,8;/N, В. = 


diag. (a, ...а,), D 
then the Problem is to find 


> х 

n = diag, (п,,....т,), e= (1,...,1) 

Minzp-ij 
n 


such that €D, Dye’ = с. 
5.4 inimum 
(ii) A moment problem. Karush & Wolfsohn (1955) consider the problem of determining the min 
distance from the origin to а point w = (w, эзш) in k-dimensional 
restrictions 


Ў the 
Euclidean space subject to 


1:1xm4] 
1" оо ke 
Baf п а n 
1" от km 
then the problem becomes that of finding 
Min wy Such that wB’ =e, 
w 
(iii) Discrimination, Let 
and соуагі 


c= Bissaw) f 
i tri ( 1 4 >) follow а 
then E(q) [Л 


(iv) Canonica 


l Correlation, 
mean vector Zero an, 


72 follow а (p+ 


" ith 
%)-variate normal distribution W! 
== [2 = 

where Y; 


a= 
and chooge the Weight, functi 
e note that Ed,q, = қы. 


v to m 
2 = “Ур, V(dj) 


s dy 
= WE. w, Correlation Coefficient between d, and 
,80 that the Problem is to find 


"à 
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T'wo inequalities. A perusal of the problems indicate that they are of two types: 
(I) Find MinwAw’ such that wB = a, where w: 1 Xp, А: px p is positive definite, B: p x k, p > k 
w 


is of rank k. 
Е wdw А = " 
(II) Find Max ОБО?" where А: p x p is positive semi-definite, B: p x р is positive definite. 
w 
The usual solutions may involve Lagrange multipliers, direct differentiation, or some geometrical 


argument. The following inequalities permit solutions to (I) and (II). 
Inequality 1. If М is positive definite, then 


(xy)? <(аМ2”%(уМ-1у)), (1) 


with equality holding if and only if zM = ay, where g is a scalar. 
Proof. Since M is positive definite, there exists a factorization M = LL’, where L is non-singular. 
Using the Cauchy inequality (uv)? < (ми’) (007) and the correspondence и = xL, v = L-wy', we obtain 


(1). Equality holds if and only if u = av, i.c. if M = ау. 
Alternatively we can now show that (1) is a special case of the generalized Cauchy inequality which 


States that in a linear space with inner product we have 


KEDI Ц. Їй, 


where || f || = (f,/)! is the norm. 
The space of all n-dimensional vectors is linear and the inner product may be chosen as ху’. Then if L 


isany nxn non-singular matrix and x and у vectors, we get 
uzzL, v=y(L-), (u,v) =a2y’, 
jul = 2210 = Ма, 
Іші = (L Loy! = yy’, 


to obtain (xy)? < (“Мә(уМ-1у”). 
By making the correspondence w = x, A = M, aB’ = y, we obtain the bound in I, 


Р (оВа9% _ ЕСЕ (1) 
(wdw) > дара ^ aB'ABa" 


With equality holding if and only if wA = aaB’. Because of the constraint we also have 


uad (aa^) 
a=wB=aaB’AB, а = сдава’ 


80 that equality holds if and only if "m ) 
РИШЕ. ИНЕт гл 
aB'A-1Ba 


I nequality 2. If A is positive semi-definite and В is positive definite, then 
= шАш” (2) 


i = max (0, ..., 0). 
Where 6,, ..., 0, are the roots of |4 —OB| = 0,0, = паа max (0; M 
Proof. There exists a non-singular transformation w = u& suc 
ш>шАш = uQAQ'u' = ирди’, 
wBw = uQBQ'w' = uw, 


.,0, are the roots of |A —0B| = 0. Hence 


Where Dg = diag. (0,, ..., 0p) апа 0, -- 


'a —0B| = 0, namely 
^ is one non-zero root of |а’а— 0. ў 
ао а le the two inequalities yield the same result 


Ms note that if А is of rank 1, mi ow. Thus in this case 


“В, a fact which is apparent from (3) bel 
(wa^)? е аВ-а'= бы. 
wBw' 
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his point we can solve (i)-(iv). Proble 
рд à 
Ari 2 Ры solved using (2). тəн "m 
or (2); (iv z1r) (eDED, Die’) > (гі H ended 

гаас ы of the restriction, the с 
eDiD, = ar. 


d » solved using (1) 
(i) (ii) es lv i (1); (iii) may be solv ad u 

1), are solved using ay e E 
lower bound, with e 


i if 
quality holding if and only 
nt 


rDie ue: rDic'' 
> орг 
(sn) = ер, = arD тре”, 
and hence "T 
i.e. 


EN,8S, Ja," ss 
5 асе by : 

) The solution is given directly by (1^) with A Ig = lxm41,and Z rep 
(i es | 

(m4 1)? 

eBB'e * 

m+) 

with equality holding if and only if w= «ВЕР (еВ). 


шш” > 


(іі) Either (1) or (2) leads to the bound 0x-wy. 
(iv) Using ( 1) or (2) to obtain the first inequ 


" LXI. а have 
ality and (2) to obtain the Second inequality, wc 


/ cipe ut 
(«Хуа é Puke Eu < бу, 
(их. и) (05,0) чуи 

i = in 

where бу is the maximum root of Уыз У{, -0X4|- v i sva dbi 
e Es that the expression q4 15 occurs as the Solution of Some of these problems, anc 
an alternative expression [ay = ма +y Mx"), " 
еу is of rar 
where M {5 non-singular and т, yand lxn TOW vectors This follows from the fact that x y iso (3) 
iios + Misty] mg ttr (M-as) = | +yM- yz’, 
A --a'a 

Таша =¥ = a, we have аА _ | E РЯ 


REFERENCE 
Kanusg, W.& Worrsogy. Қ; 
Proc, Amer, 23-32. 


On Certain Properties of Power-serieg distributions + 


Bv ©. с. KHATRI 


М. 5. University, of Baroda 


Ivrropveroy 
l. Noack (1950) hag defined a Power 


-Series distribution as 
PE = =) = z 


(1) 
x: а, Z* is convergent for |Z| < v. Her 
nd factoria] Cumulants, Which are utilized to Bio nis 
rmine, uniquely from its first two mon 


: КЕТЕРІ. 
+ an illustration 9f multinomial distribu 
ributiong. 


^ 


— > 
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2. Relation between factorial-cumulants and cumulants. The rth factorial-moment is defined by 


ЕВЕ) (§— 2h)... (Е rh +h), i.e. the coefficient of 07/r! in the expansion of E(1--0A)^. Hence, the 
factorial-moment generating function is 


A F(0) = Е(1 + Ол)". (2) 
Note that as л > 0, F(0) + Ф(0) = Е(е%), (8) 
the moment-generating function. The relations between the moment-generating function and factorial- 
moment generating function are 
1 
FG) = алоо +) (4) 
and Q(0) = F((c/^ — 1)/A3. (8) 


Hence the relation between the factorial-cumulants and the cumulants сап be established. If кү, is 
the rth factorial cumulant and к, is the rth cumulant, then 
Kp >K, df A0. (6) 


3. Properties of power-series distributions. 
(3:1). If «jj is the jth factorial cumulant of the power-series distribution (1), then 


1Х4 
кы = zo (z^ ЗЕ 


аку. 
кину = 2 E -Ү) Құ» 


ан а а im 
where (2-35) = (r3) (ra) ... times. 


Proof. It can be verified by induction that 


and 


pr д г 
(1--0h)r Z-r^ Ф201 Oh)" = zz) d(Z(1 4-61)! 
20" 22 


and so, with / (м) defined by (1), 
а 2 | apad йен. С i ук 
gj ZAZA дауа) = (14- 0h)-* Zr^ (2 а) log f{Z(1 + ӨЛ). 
o seo that the factorial-moment generating function F(0) is f{Z(1 + 0h)™"} /f(Z) and so 


(7) 


From (1), it is easy t 


а (8) 
= {—_logf[Z(1+0h wj 
| gg SIA ) w 
Henco from (7), it follows immediately that 
d 9 
Kin = a (27) logf(Z). (9) 
Differentiating (9) with respect to Z, we shall have the recurrence relation stated above. 
Coronary 1. If A = 0, we have the relations for cumulants as 
ey = (10) 
к = (6%) logf(Z) = 2-%- 
Corortary 2. If = 1, we have the relations as 
(11) 


d' _ „Кии (Пк. 
Ki = 2198/0) -2-а (7 " 
termined from its first two cumulants (or moments). 
ribution be given by f(Z) and the first two cumu- 


distribution is uniquely de 
nknown. Then from the cumulant relation (10), 


ries which determines a dist 
where Z = g(p) is u 
NES = fat) 
Я Ж. , where y(p) =/(9(P)) 

we have к. = Z-Z од = gp ENP) E 
ак _ аа [E 
= ЛЕТ = dl dp 


(3-2). Тһе power-series 
Proof. Let the power-se 
lants be given in the parameter p 
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(12) 
Tosg лы Ka = 3. log gp) кі. 
Therefore dp dp dp 
dk, / ) : 
= Jap] 2) des = logg (p)+loge,, вау, 
Hence logZ ТЕ 2) dpt e; 1 И" 
i Z= Gp) 
" gs 1 loge, say 
d logy(p) = ur кз) dp c, = log y, p) -+log ей , Р 
an 
(А 
ie. (0) = ey (p), 
where су, Ca, c, and €, are constants, 
Let a, be the coefficient of (gp) 
in the expansion of 


icient of Z* 
in the expansion of з. (р), then саста, will be the ieri e 
probability that the random variable $ takes the value г 
C201 "a, ур) = ash рук). 
Hence, wi У, We can assume in equations (13 


Ик. 
СЕ 
апа 


Ур). Hence, the 


(15) 
) and (14) that 5 mw mL 


(16) 
; ived in 

ve Кү and к, equa], then by the expressions derive 

се we have proved the theorem, 
-series distribution other 

4. Illustrative examples, 

(а) Negative binomial distribution, 

Let 2) = (1—Z)-^ for 0 <%<1 


ts 
than tho first two cumulan 


э2>0,2- 


P/(1+p) 
Р( = т) = Газа) 


= DÍq, p >0. Тһеп 
We have from cumulant ге 


P*q-"-2 for - 0,1,9 "n 
2!Г(и) 2 9 2=0,1,2,.... 
lation 
dk, , dk, 
GSA  — 24 ар” р 
“ПЧ aaa ee Ж е м АҒАР" + Ма Е са are given: 
we have ho. s) Ф Ka = npq(q-- р) an К = 724 (1 + бра) Conversely, if K, and к, 
^em (|2 =? 
Pq q 
and 
п Л2)- ехр (о) = 4" = (1— Z)-», 
Hence wy have the negative binomial distribution, 
d ae € ae binomial distribution truncated at 2 = 0. 
zi ть 2 = Pla, p > 0. Then 
n à ES 
hip - —— for 2= 1,9 "m 
IT) oa ИИА 
In this case also, we have Kies эаак, slp d q'—I 
- » во 
Кі = ара (аа 1), к 
ovest, from тапак, we m db Ke ^ пра” ( д» —@—р)/(а% — 1), ete, 
У Partioular case op (jv а “мы, н; Dis £1 ЕРЕТІН 
à Shall haye 
Kim Pllog = 19) 
а cumulants, wo ылу 7 and x, 2091089 р) пов а)° i 
teme 
and = 
| о 
984 
Hence the distribution is th logar S (2) = — 8 ( ). 
ве, 
Pig ша)ш — 
) pq “Itlog g for т-19 (20) 
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| Positive binomial distribution. 
wt f(Z) = (1+Z)", Z > 0, n is a positive integer, Z = p(1—p), 0 €x p < 1. Then 


P(£2z)-2 и = п-г 
(Е = x) () p*(l—p) for £0.12, (21) 
The cumulant relation is к= 0 et -pl- e 
dZ ы ар” 
апа зо ку= пр апа кұ-ар(1-р). 


The converse problem is similar to (a) and (6). 
(d) Poisson distribution. This can be treated in the same manner. 
5. Multivariate extension with multinomial distribution «s an illustration 


Let IG Aa) Ж ты. ә INS В 


Fs con ы BO: 


| i 
NS we shall define the probability of the random variables £,....£; taking for values r ғ 
espeetively, as ice ма, 
1 ely, а PE, mae. рша, = Oy, ag ZT ИИ... Фу)» (23) 
(А) Example. Let f(Z,. Zi) = (V ZA b ES where n is a positive integer, 
+1 
Z,-ppee Op sk. Ур=1. 
ізі 
т n! 
Then P(E = ть. Se = te) = = pit... pe psi 
auberge! 
* kl 
Where У aan (20,13...) (23) 
і-1 


This is a positive multinomial distribution. 
ial cumulant of the distribution given in (22). Then it is еазу to show that 


Let Қ, сай be the factori 

k eM 
Kisses rit П ежен 2) logf(Z. .... Z4). (24) 
ізі Chi, 
Кин os Hf ogo 
— 8, Күн, гы 


шы Кү, 
reach the corresponding relations for eumulants. 


If A, = 0 for all іп (24), we can at once 


Example. Let қы 
Hs m (+r +20)". where Zi = piper OSPS. à Bye 1. 


We shall here only write down the means, variances and covariances: other eumulants can be written 
down from the recurrence relations. 


k қ " 
E(£)- Б > Л жару for Ф=1,2, 
: A 


ВЕ) = пра) for i=l 2k 


a ё = +7 = 1,2,..., 25) 
н 2m omips«zclH)--—u8p fo і%4)-1,2,... Ж. (25 
ала cov (Е.Е) = фа е - 2%, (5 1р} 
TA istribution from means, variances and covariances 
das verse pr blemofobtaining the distribu à 
Now consider theconverse pre of the power-series type. 


) if the distribution is 
e ...К. Then 


k 


ons. 5 
Riven in parameters (yi: Ys 


; eru: 
Let Z, be the function of (fi Ye» а) b= b2- 


ё r gu Sue 
vit E) = Lace ШЕ = ZA 
cov (£o £; УЛ 21 6 
а k д, 2 
ё ey =Z, У LEG 
=Z, 27, (ё) ад БЕДЕ? j 
fox} 
+ {= ә d 1 Е а 
j = 1,8, 0k anc аи ? pg) (= 1,2, s dj: (26) 


k 
JE) — 5 
1 (£2 ~ >Х eZ; ст 

Biom. 46 


31 
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fe ons Wi 1 oluti 2, ду, (220), 
t o i (26), we can obtain the s lution for (CY, 
uati > 
m the ве! q 
Fro 


ancy ie) 
as a function giae 

we have 

k). Then we 

APE 

G,t=1, 


dZ, dZ, > (27) 
UNES D Yd +... 1, — =l РЕ 7 
а: Ii, x for i , 
ЖЕЛ yx) + DEOR yk Z 
Vi = gii тақ 2 


i ta. 
tions, we can ob 
Ъоуе set of equat ‘ 
seria Bie hence obtain Zi. T ie 
шь y Mi а Solution from 24822.) og, 
-— v the distribution. 


сел 
E etions о 
а і s of func p 
in dZ,/Z 2 OZ, еи) in terms "os A 
Е de da y^ i arms of 21. 25 "ies; 
42, d can express E(£,) in tern diventa тық 

À B meee (6). Thus we can determine 

Zi es 6 = ЕЕ). & 


25). Тһеп 


= nz, £P! by the use of (26), 
cov (2, 2, 
for і+ј = бый. 
і.е. 
for j= ВИ Ж 
P 2 28) 
Similarly dz, ажы, көкет 42. ( 
ар, ағ, VOL a Е = 29 2, 
Непсе —=-р, = - - ( m 
* i Б. Ға 5s К), we have to obtain the inverse of 
To obtain 42,2, т terms о Di e 
=I-Ip’, 
“Ру =p, 1-р, 
Where р’ = (n bg 5р), V = (1,1, 


А that 
matrix, We can easily show 


k 
Dea = 1- b» Dj. 


3-1 
= 1 22, 
1 ie is JL ана 
2,ёр, а, 
E 


= Т+1р'р, 1, 
Thus the Solution ig 


= т d ше 1,2% k. 
l/p for 4 Tm 1,9,... 
2, т , It kel 
p= logpi[p, i.e, = Рика. From 


k 8 
EE) Zim b» 4) лл. 
22, 


: (29) 
We obtain ДР, ... Z) = nlog (i+ a en ты 
ј=1 u р 
i - Since (29) is true for all i, wo mu: 


k n 
JV) (1 + bI 2 ; ix 
qe p das cen 
yy Te 
"5755. to take the values x, 
ultinomial distribution, 


ee 2-..-2,) > Where " 
k 
inf (o da), 052,51, %<1 and рь о 
= м] 
"Then E 
n at) А (E ) (30) 
жылыт. 3-1 Я ч Б Е: =0.1,2..... 
GET эф, E79] hy! Tn) pi niea +») del for ally 0 

We Shall have means, ariances and covariances as 

(&)— xp, y 6) = np] +P) and SOM ЕУ = "DO for 4 Fj =1,8,...,®. 
The Converge Tesult can also be obtained in the Same m, 


anner 


as the above p 
REFERENC 
1950), A class of random variables 


e. 
" ial on 
Ositive multinomia 


-32. 
i 2 197 
With discrete distributions, Ann. Math. Statist. 21, 


— f. 
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REVIEWS 


The Advanced Theory of Statistics. Volume I, Distribution Theory. By MAURICE 


С. KENDALL and Аглх Stuart. London: Charles Griffin and Co. Ltd. 1958. Pp. 

хи + 433. 845. 
Тһе 16 years which have elapsed since the publication in 1943 of vol. 1 of M. G. Kendall’s The Advanced 
T'heory of Statistics have witnessed a remarkable, indeed a rather bewildering output of text-books on 
Statistics of very varied quality. It is therefore most satisfaetory that Professor Kendall. with Mr Alan 
Stuart’s help, has already completed part of the process of bringing his pioneer contribution up to 
date. To maintain the same comprehensive standard, three volumes will be required in place of two: 
the first was published in the summer of 1958 and the second is promised in 1960. 

Broadly speaking, tho first 13 chapters of the new vol. т correspond to the first 11 chapters of the 
original vol. т. The increase in number results from dividing the old chapter 7 on Probability and Like- 
lihood into chapters on The Calculus of Probabilities and Probability and Statistical Inference, and 
the splitting of chapter 11 on Approximations to Sampling Distributions into two. 

The expansion of the work into three volumes has no doubt made it possible for the authors to think 
out again the order and form of presentation of the later material; as a result the present volume closes 
with three in place of the original five chapters; of these, chapter 14 on Order-statistics and 15 оп 
Тһе Multivariate Normal Distribution and Quadratie Forms are new, while the final chapter 16 on 
Distributions Associated with the Normal contains material drawn together from elsewhere in the 
Original vols. r and и. | 

The changes made in turning the old chapters into the new are of two kinds. In the first place there 
аге additions of entirely new material made necessary by advances in statistical theory and improve- 
ments in its techniques. Thus we find, for example, accounts of R. A. Fisher's logarithmie distribution 
with an example (885:10, 5:17); of the log-normal distribution and of N. L. Johnson s Su and Sz 
frequeney curves (556-27-6-35); а reference to uses of the method of steepest descents ($ 11-18), and 
а brief account of the method of deriving sampling moments systematically with the help of tables 


ОҒ symmetric functions, such as those recently compiled by F. N. David and M. G. Kendall ($# 12-5). 
; erial, it is evident that the authors have reconsidered the 


But apart from tho introduction of new mat | 
detailed presentation throughout and where they thought that this would clarify the argument have 


made alterations by slight re-arrangement of the text, by addition of paragraphs and by кернеуде 
of tho illustrative examples. This process of improvement has been helped by € 2 хонар ы 
—the main section headings are now in bold instead of italic type, the tables are heal od in А Жа 
and printed in a more pleasing manner and the algebraie formulae аге now ӨЫ еа а 2 27 е n 
Тһе first doubling of chapters mentioned above has permitted an improve p д E dro 
brief introduction to the ideas of probability, likelihood and inference which was inserted 1n g 


i i ү i i Пу a mathe- 
i “Ww i oint to the development in vol. 1 of what is primari r 
edad caa E кте Thus the concept of а random variable is now treated in 


mati ry of statistical distributions. у | и 
кедір е ШЫҒЫ 112-719) It is interesting to note the authors’ conclusion, шоо a - 
V КА JJ. à E ; 

following sentences (p. 180), that there is no unique best method to relate a mathematica! y 
the way wo think: | | | амалы 
for himself and that his psychological make-up, 
1 ever, that every man must choose у Ш з ра log Е 
his iis rete ОШ his fields = е all determine the kind of axiomatization d — pls 
In " ti ties it is a mark of immaturity to argue overmuch about the fundamentals or p y 
statistic 


aa asis is presumably on *overmuch' and the authors are 


i 5 hi + . 

In the last sentence the intended emphasi | | di кча ы 

inki сі tic assertion which claims that опе m Lof appro 

bier. — y еі eine mean that there should not be continued interest in the fundamentals 
ers wrong. 


i с / > is bound to be 
Dor fail to realise that when two or three people discuss such matters keenly there is 


argument ! А 3 ke sible some re-arrangement and expan- 
y d 11 into 12 апа 13, makes pos: > 3 + 254 
i : s poor ds e: iiber sumando and sampling cumulants, including the ше з P 
no E c us ауен Жүйе а, the second of the two chapters deals with bivariate k-statistics an 
ulations. As nov , x А ter. 
tho proofs of the combinatorial een Lepus үлды pies sections on the distribution of mth 
The chapter on Order Statistics (UD bendes aen ie rs, contains a good 


S and о v4 viously appeared іп зе chapter оп Stai ат Diuve) 
alue 1 lin tl 1 ti Standard I 
d of range v hieh previous Т 
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ivariate № M ЭЛЕК и 
deal of new matter. ctor and matrix notation whieh prema will be taken up again in vol. 1 
Forms introduces the aa of multivariate analysis is to be included. 
where the oe ee qe bes with а mixed series of chapters in which, 
Тһе original ке straightforward numerical illustrations were 
Fargo ecc. Ja correlation and regression, and (c) of othe 
eu. of applieation has now been remove 


i ratie 
ormal Distribution and Quadra 


besides the development н the 
given of the uses (a) of t tie. 
r measures of association. All this i 


; ; сире nes, and 

d, presumably to be included in the latc r "qnd ne 

1 Y M 1 M з Associntec 

эг i single ehapter on Distributions Assoc 

j 7 here brought together in a sing! 
instead we have 


jd 
m: m i i istri ions of the standarc 
i i c tions to the distributions of 47 
athematical properties of and approxima: 8 un * ООВ 
— "^ н тар s a Х, 1. Вапа = r and b. Тһеге аге obvious advantages in this pini Ж 
eo ie ке к; istributi з Ін required for later ay á 

i i зе the dis tion theory of the tools requ Е pere 

е kes available in one place the distribu зогу о гоб, 1 АТ FS 

e ccu closes with 12 single-page tables, 10 dealing with the normal distribution, is d and Hii 
and two giving (i) augmented symmetric functions in terms of power-sums up to weig 


È The last two tables 
multiple k-statistics in ter ented symmetric functions up to 6th order. The e = quötod 
are linked with the discussion in chapter 12. There is also a list of References to the authors 


А x iiis зе name 
and а 14 pp. Index, It is a pity that here (as well as in another recent, book on statistics) tl 
Gosset has been Spelt with two ts! 


i я үг во that there 
The Exercises left for solution by the reader have been greatly increased in number so t 

are now about 20 at the end of each chapter, This should 

student. 


rae 04 serious 
prove a valuable addition for the s 
It is of course impossible 


at present for a ге 
because its Successful rounding off must depend on the way 
Inference and, Statistical Relationship (2) and Statistical Plar 
are built up on the present volume on Distribution 
inevitably alternative 


* х es whole, 
viewer to survey this great undertaking as a 


а А 0 гаса! 

in which the later volumes on one (3) 
{, , 1 Serie: 

ining and Analysis, and Time-Seri 


ат " ure 
Theory. With a project of this magnitude данам 

methods of arranging the order of the parts. In dealing with statisties and | 
bability the writer of 


д ion. 
ап expository text is faced with the problem of ordering та single mo 
Subjects whose links are essentially multi-dimensional, From the teaching point of view it is instruct 
in itself to see how the present authors have attacked their problem, 
The decision on the broad plan wag taken by Professor Kendall nearly The 
already appearing in the revised vo], Mis plan is being modified to some extent. iso 
removal of much of the numerical illustration from the later chapters, while retaining a generous uh 
of such illustration in the firs TS appears a sound Policy. A more difficult matter for decis! is 
must have been whe extent to introduce the ideas of inference and random s 
pling. The placing of an account of Bayes’s Theorem and Maximum Likelihood with a discussion 9 
randomness and bias in Practical в; ing i 
lous. 


f қ ч z г эта” 
: г А racti middle of vo], т is undoubtedly ІП some ways anc he 
So, too, is the inclusion in this volume of t, of x2, t, P and z before t : 
reader has been Shown h iba 


ables of Percentage points vil 
it is remembered that ultimately all three p agar 
wn linking and cross-linking О 


shanges 
20 years ago, but the chang 


hoose at will his o 


ark that he awaits the appearance of vol: 
ishes there were a solution to th 


; ant 
8. п and ти in please 
ing his own е 8 of the complet, 


; the 
e problem of how Lom 
ора е series at a total p 


E. S. PEARSON 


: Cox. New York: J 


ohn Wiley and Sons Ine; 
1958, Pp. vii +308. 60s, 


E .. Ва person experienced in handling ж 
айоп (decision-making is a further step which usua {те 
istic me fluency in various ied 
xtending these techniques. v ah 
ized problems arising out of ns 
^al data. All three classificatio! 

ian of most į ' value to а commerei 


sufficient mention of the 


mit 
and standard texts that n еһ- 
encountered ; i M ical tee” 
nee in these ed in ying the mathemati 


lif’ 
"w graduates very seldom qué 


"c c—Rn——————" 
— — Ж ÉÓÓ——— Áo 
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to: bieesllodivenLatutisticinns i 

n ae and it usually takes them some time to supplement their formal training 
| Dr Cox's book is extremely welcome in that it concentrates on just those matters where little guida 

is available elsewhere. The practical aspects of experimental design are here dealt with ee hi МУ 
individual way —the discussion avoids mathematical кеенен Нн, yet is careful and азаа; 6 i 
mostly in terms of concrete examples, taken from many different fields. It is intended primarily for the 
private reading and reference of the experimental workers, and requires no specialized knowledge for 
its understanding: thus for instance the details of the analyses of variance involved receive only incidental 


mention. 
Of course, there is no substitute for experience; and Dr Cox is careful to point out that his examples 


have often been si iti : inci i i i 
` an simplified so that the principle ar discuss ^ rly 
ехе See * п! а 1 e principle under discussion might more clearly be brought out; 
nees to the original accounts have been given wherever possible. Nevertheless, the book goes 
+ "mid T is i i 1 " 1 1 1 p 5. : 
much farther than is usual in either elementary or advanced texts in bringing out the difliculties, and in 


expounding general principles. Its relation to such works as that of Cochran and Cox is not so much 


introductory as complementary. 
The first nine chapters deal with basic concepts and the key designs; the last five touch briefly on some 


more advanced topics. The concern is solely with comparative experiments so that, for example, it does 
not cover the planning of surveys. 


In summary, this is a book about r 
it should be made required reading for all students of statistics. 
C. L. MALLOWS 


cal statistics. It is casy to predict that it will have considerable 


success; 


Theory and Methods of Scaling. By Warren S. Товаввзох. New York: John Wiley 
‚ London: Chapman and Hall, Ltd. 1958. Pp. xiv +460. 765. 


Social Sciences Research Council appointed a committee to study the relative 
erson was chosen to review and summarize, under 
begins with a theoretical examination 
and then proceeds to classify the various techniques for 
fter discussing the alternative criteria 
scheme based on ‘the different ways 


develops a working $ 
es may be allocated ’. This leads him to distinguish three 


and Sons Ine. 


In 1950 the American 
merits of different methods of scaling; and Dr Torg 


the general guidance of the committee, the material available. He 


of the different types of scientific measurement, 
laborated during recent years. 2 


metrical scaling which have been e 
that could be used for such a classification, he 
in which the variations of the subjects! respons 
main groups. 


. | E m И 
The first group of procedures consists of those which attribute the variations m the subjects’ responses 


to individual differences in the subjects themselves: this he terms the ‘subject-centred approach’. 
It is the principle underlying most of the work on mental testing. But, largely because it has pomm so 
fully discussed elsewhere, Dr Torgerson has decided to exclude it from his own review. The р ей t 
adopts what he calls the ‘stimulus-centred approach '. Неге the systematic variations in the su e 
reactions are attributed to the way the stimuli vary in respect of some specific attribute. Lai ee y 
ofthe well-known psychophysical methods fall into this group: and these with their = ^ i т pe 
are diseussed in considerable detail. Separate chapters are devoted to methods ыы m E he c = 
comparative judgement’, the ‘law of DE Uere ^ dale 252244 multidimensiona 
scaling—a topic to which Dr Torgerson has himself made important contri! b : | | 
The third UR adopts the 'response method'. Here the быт aa ros parisiens 
ascribed partly to variations in the subjects and partly to variations in m - sie ice —— 
ошап n kp Late сым "E d = т 2 = two main subclasses— 
the р s devised are re atively few and compare 9 ic ue eines n 
Sema ТЕ of which Guttman’s is the best known караа buone models’, 
ЖЫН иийй тна а ue aca Mecum нао а heads are lucidly explained with 
А a a important procec ur н ap pue am Ў i se кедей 
о кр the Pnderlyiug theories on which they не m or ymo —€— À с че d 
апа limitations impartially discussed. Much of the material inc ST ео сом? е. 
in isolated art icles or specialist text-books. Hence the volanie eal ет аша. À g 
who seeks to apply quantitative techniques to social or psy¢ g CYRIL BURT 
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Linear Programming: Methods and A 


ны Yew York 
Pplications. Ву Saun І. Gass. New Yor 
and London: McGraw-Hill Book 


Co. Ine. 1958, Pp. 223. 50s. 6d. 
ѕ on Linear Programming 
It answers in this respect a 
aims). The text had been pr 


К ы sive formal 
in that it contains an extensive “oe "s 
MES. 2 з бев 
real need (even if it is not the md is 
epared for an academic се А-а 
+ " iets ` then 
d readership— with well-defined and predictable —— viia 
i 2 ` g 
evel of Sophistication is perhaps best defined by men uns d hi 
z —— 
hat about 20 Pages are devoted to the formal theor VID 

convex sets and simultan i 

’ readable р 


' ional. 
art 11—Methods: Theoretical апа Computatic 
Subjects related to the Simplex Method 
Parametric Linear i 


Partly to the author himself. 
ег uncertainty, Тһеге is, also, 
:essarily and admittedly alre: 
not the only feature which makes the book a 
rete Linear Programming are mentioned or 
own when the book Was conceived, But еу 
n the reviewer's 9pinion—met} 
as followed here close i 


idt includes a бире =>. 
) which deals with а minor n 
а survey of available automa 

ady incomplete, А 
Ppear somewhat dated. ine шал 
nly casually, clearly because soie : әнін 
en regarding the foundations thert | ^ hÝ 
actie valne than those presentet E 


7 en р ' саз 
8, whose Importance is, in many 
50, the purpose of making the matter clearly 
praise which some other 


programmes for digi 

However, this is 
non-linear, and dise: 
methods were ип 
now—at any rate j 
the author, Heh 


understood is achieved, and thi 
books on the same subject have failed to deserve, eather 
When we come to part I11—Applieations, we find the author less successful, He gives here a га 
Scrappy list of descriptions of Linear Programming in pract ice, bu 
not know the e ў 


8 
кедір: к 3 ar who does 
titis Impossible for a reader wh 

© guess how some of them could | 


Б form. 
зе put into Linear des sei ы ы 
ce, to the Paper Trim Problem, on р. 182, and to the Travelling 5 
* OD p. 186.) Tt is true that the book i 


ы р ed by 
К contains а bibliography of applications, eens of 
» Industria] military etc., but too many of the titles refer to Se 27 
"9. Government agencies, even where their text has since appeared in readily available periodi 
e.g. Management Science ог Operations Research nd 
e last Chapter contains а clearly апа Well-written description of two-person м 
establishes their Connexion with Linear Programming les 
3 ч е 
оп Гек Ваз беп done very carefully опу very few misprints Survived (e.g. coeds for oe 
p. : ie vy Ohm eration 9n p. 6 should be one of alternatives, not of simultaneous "c 
Tir i and ee RE mul, bove, the book is а very useful contribution to the eren! 
ге, item on (| shelf of any 0 is interested in 
foundations "^ 1 Y Operationa] Research worker who is interes 
Well as the applications of his subject, 5. VAJDA 
Wahrscheinlichk i а 
eitsr ili > 
tee echnung ung Mathematische Statistik (Probability pa 
Verlag а ~-atistics). Second edition. By M, Fisz. Berlin: VEB Deutsc 
ag der Wi, Senschaften (Hochschulbiiche, 
Pp. 528. Ту Е 
The first eq 


58. 
für Mathematik Band 40). 195 


H i wed 
1 statystyka matematyczna) was "iit 
+ Birnbaum, This second edition, much more exten: 
У in Polish and German, ith 
d with the theor ili шей 
1 ә У of probability, the ве А 
5 the theory p lating to inference from sample. values to the universe i 
а : rst five chapters the author discusses basic ideas: О 
Y s variates ang their distributions and чака ат 
6 Sixth cha ter is devoted to the more mien T 
many ofthe ma; ematical formulae help to HBS tor’ 
е minimum yari 2 © author uses sion ‘the most efficient a or ^ 
ontains definition ‘the efficient M 1 be 
bility and statistics and shoulc 
90k and üs z i 


th 
ехрепвіуе Ғор; 8 Siz e useful bo 
s text-book. Or its size and should b 


рт 
REGINA C. ELAN 
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On the Dynamics of Exploited Fish Populations. Ву В. J. Н. Beverron and $. J. 
Horr. (Fishery Investigations, series и, volume xix.) London: Н.М.8.0. 1957. 
Pp. 533. £6. 6s. 
It is a pleasure to bring this book to the notice of statisticians and biometricians. The authors, one from 
the Fisheries Laboratory at Lowestoft and one from the Fisheries Division of the Food and Agriculture 
Organization at Rome, set out to describe in mathematical language the processes of fishing and seek 
from this description to predict what the effect of changing methods might be. Part т is concerned with 
"Fundamentals of the theory of fishing, illustrated by the analysis ofa trawl fishery’. A preliminary model 
assuming a steady state in a fishery is described and the model is then distorted to allow for other charac- 
teristics. In part 11 extensions of the theory of fishing are discussed. We are introduced to the effect on 
the fish population of egg production, natural mortality, gear and fishing intensity, and growth and 


feeding. 

Now it is clear that the setting up of mathematical models will involve the estimation of the para- 
meters involved in these models. This estimation is thoroughly discussed in part ит, while part ту gives 
the effect on the model of possible variations in the parameters and attempts prediction. Part 1v in 
fact. tries to predict what would be the outcome of attempting to regulate fishing effort and net-mesh 
size. The authors come to the conclusion that the ‘best’ results—*best’ meaning increased catches, 
inerease in profits and saving in the overheads of men and labour—will be obtained if the fishing effort 
i strieted to half that of the years before 1939, or less, and if the trawl nets have an enforced 80-90 mm. 


mesh size. 
The statistician will be quite clear that to set up mathematical models is not enough: it is also necessary 


to verify that the model fits the observed data. The authors are also aware of this necessity. They do not 
in the course of their book usually give the data on which their models are based. What they do is to 
refer to the published research work of themselves and others as justification. In the light of the length 
of the book in even its present state this seems an admirable procedure. 

The exposition is lucid and enjoyable. It can be recommended to anyone tired of abstract theory and 


wanting real applications. F. М. DAVID 


Contribuito allo Studio delle Tavole di Nuzialità. By С. Panizzon. Padova, Italy: 


Cedam. 1958. Pp. 143. 1500 lire. 


This book is concerned with the construction of tables of the probability of marriage in Italy. The 
probability of marrying within a year, for a bachelor or spinster of any given age, is termed the ‘relative’ 
probability. Since an appreciable proportion of unmarried persons will die, emigrate or immigrate within 
the year, a correction has to be made for these factors before a valid comparison can be made between 
marriage rates at different epochs. The author therefore calculates a hypothetical ‘absolute gem, 
of marriage which would result if migration or death could have been prevented. The life tables = 
on these absolute probabilities are called ‘gross’ tables. The author shows how to obtain relative an 


absolute probabilities and net and gross tables from available demographic data, and gives CX eem 
of his own based on the 1951/52 census, comparing these with earlier Italian work, and also with Britis 
iage is now postponed, at least for 


marriage rates. The general conclusion is that on the average marri S db 
reciably i 0. The modal probability for males has shifted by 
bachelors, to an appreciably later date than in 190! " p SS ee ан 


i i i i ller. 
1 , 4 years in the last 50 years: in females the difference is rather smaller, | 
кызл ап у ated by numerous tables and a few graphs. There is also & 


appreciably less skew. The book is illustr: х x 
discussion of abbreviated tables (at, say, 5-уеаг intervals) and rapid methods. CEDRIO A. B. SMITH 


у. By C. I. Buiss. New Haven, 


ion in Biology and Climatolog 
es ei Station. (Bulletin 615.) 1958. 


Periodic Re£ : 
Connecticut Agricultural Experimental 


Connecticut: 
Pp. 55. Gratis. 
Тһе author discusses the analys 
dently, where the observations are sp 


at each point. Illustrations are given on monthly 


ote: ; death rates from pneumonia, 

tial in an e tree, monthly dea 1 s 1 

[е > i cmm E: human on of toads to gonadotrophin and 
y incide 


births, reactt сәкен 
гес опса . 
cows. It will be noted that the d free on apr 


en the length of the cycle is determined indepen- 
vals and there is а constant number of replications 
iodine values for butterfat, hourly 
monthly incidence of poliomyelitis, 
hourly heat exchange of 


F. N. DAVID 


is of periodie data wh: 


aced at even inter 
temperatures, 


bulletin is issue! 
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Inv P i i i ions search, No. 1.) 
i i ublications in Operations Researc 

entories and Maintenance. ( 

Queues, 


: Chapman 
By Рнпар M. Morse. New York: John Wiley and Sons Ine.: London: € hapn 
ani Hall Ltd. 1958. Pp. 202. 52s. 


Тһе authors of Finite Queueing Tables open their preface with the i 
t-books and monographs which discuss the : 4 ese his aubjeot. Кв 

bee lcomes Professor Morse's book as the first in English devoted entirely to this 

ет DIM "The present volume does not. pretend to be an exhaustive 

о | to present enough of the 

hniques...", and within t 


ы "ve äre several 
statement that ^ ther = ы his 
theoretical aspects of queuing theory `; neverthe 


treatise on queuing ose 
concepts, to define some of the а чеп 
his framework he is very successful. 4 1 jh 
ehaviour (which he dismisses too lightly: ‘Transient solutions 
rtance’) he confines his attention to the 
vice-time distributions are 


; ntial. If 
related to the negative exponentia 
king of too general stateme 


Bie 40): 
nts. Sometimes he is cautious vr b is 
boss da ishment times із 
probabilities are the same no matter what the distribution of replenishment 
(constant, or exponential, or hyper-exponential) "—sometimes he 


is right (р. 98): “Eqs. (7-37) and x 
robabilities Р, form a Becometrie вот As 
ibution may chance to be.’ But at least once he is wrong (p. . Ж Тп 
with any Single. infini леце system, the probability that the System is empty is 1 wen 
in D. б. Kendall’s now Standard notation) by E,/E,/] one can ob 
P,-0-pü-ja *P— AU py? 4- 4p], 

which is not generally equal to 1—p. 

Technically the book is well pr 
Presentation which could cause confus 
an E, arrival mechanism in the reverse 
fig. (8-5) does not corres) 
consistent. 

At the end of a discussion of a 
been reasoned out qualitatively w 
can ensure quantitatively that w 
vades the book, and although во; 


oduced and the number of 


ion to the unwary. On 


order: on р. 84 he nu 
pond to equation (8:6). The 


y f 
қ = А т” - ` pint о 
misprints is small, There is onc p s of 
` phases 
р. 47 the author numbers the phi 108 
М " » in 
mbers them in (һо natural order: on | : 


А = à ae ism 15 
notation used for ап E, service mechan 


| | ve 
maintenance model Professor Morse has written: * All this could we 
ithout recourse to the equations, By use of the equations, however jn 
© use the repair facilities in ап optimum manner.’ This pria T i 
me of his arguments aro a little unti i E i 

н е £ idy he has written a book е 
be ее за s tuo practical man who wants to describe real Situations, and useful also ps ja 
mathematician coming fresh to queuing theory who w, c it i rithout be! 
overwhelmed by iud гу ants to know what it is about wi 


D. M. G. WISHART 


n | Peck 
perations Research, Мо. 2.) By L. G. PEC 


n 
2 London: Chapma 
: o! Professor ) appears to have been written epo 
nd Hazelwood had not gue а 
wn: its would greatly have ecu ten 

tially the same audience) had a un 

| 1 f je the 

6 concerned Primarily with Problems of maintenance wher the 
8 finite, Two € | 


4 sof Ns 
s, Dang p " etions 0 
Parameter x (which ig ^m F, are tabulated as fun 


and essen 


т : 
shi p service): 
Е > a machine | roken down and waiting far вату 
ce being delayed (it i 


ility 
0 е case of Cd (it is really one minus the pui 
h ^ Population We encounter the difficulty Же 7 me Population these are the same, 
there are no more calls to be delayed) ASD. в. с dE Siting 
appear to be i Р үле Bis Ge 


derived D foy М = |, ie 
tables for М = Хх at „апа № = 3 


5 ч л then 
Қ Population is broken dow Dis 
rmulae 11.19 1 тз Pointed out to at least one reviewer, the taP as 
1 АЫ. i г Morse’s book this reviewer is 
Е: RA гоо"? 
ith D in Peck and Hazelwo' 


T 
Bait: fü WISHAR 


ions of 
steady-state solutions € 
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Handbook of Probability and Statistics with Tables. Ву В. S. BURINGTON and 
D. C. May. U.S.A. and London: McGraw-Hill Book Co. Ine. 1953. Рр.іх--246--72. 
46s. 64. 


This book containing 246 pages of expository material and 72 pages of tables is intended as a ‘cook-book’ 
for users of statisties rather than an instruction text for the embryo statistician. The idea has been to 
summarize formulae, definitions, theorems, ete., commonly met with in elementary statistics and 
probability theory, and to give short tables of the particular functions thus summarized. The topies 
covered are the binomial, Poisson and normal distributions, regression and time series, tests of signi- 
ficance and confidence intervals, elementary analysis of variance, finite differences and interpolation, 
and quality control. The tables include the binomial ,C,p7q"-* for n = 1(1)...20, x = 0(1)... п. 
р = 0:05 (0:05)... 0-50, the summed binomial for the same set of values, the Poisson distribution 
етуу) for т = 0-1 (0-1)... 10, 11 (1)... 20, and the summed Poisson, the normal curve area and or- 
dinates and the first five derivatives, tables of F, z, t, and x°. There are also supporting auxiliary tables. 


Che exposition is adequate without being inspired. 
F. N. DAVID 


Elementary Matrix Algebra. By Евах2 E. Нонх. New York: The Macmillan Co. 
1958. Pp. xi4-305. 52s. 6d. 

The author has had considerable experience in presenting the elements of matrix algebra to varied 
groups of students, including statisticians; the present volume appeared in preliminary editions in 
1952 and 1957. The material has been very carefully arranged and graded. The chapter headings are 
Introduction to Matrix Algebra, Determinants, The Inverse of a Matrix, Rank and Equivalence, Linear 
Equations and Linear Dependence, Vector Spaces and Linear ‘Transformations, Unitary and Orthogonal 
Transformations, The Characteristic Equation of a Matrix, and Bilinear, Quadratic, and Hermitian 
Forms. This final chapter includes a proof of Cochran's theorem. There are appendices on the X and II 
notations, complex numbers, and isomorphism. 

т Р қ 

The emphasis throughout lightly towards abstract algebra—tields, groups, and vector spaces are 
defined in the course of the work—but these concepts are always carefully introduced and explained in 
concrete terms. There are many exercises for the student, some of which are marked as comprising 


results of particular importance. ай 

The author disclaims any pretensions to completeness, and detailed treatment of applications has 
not been attempted; from the statistical point of view it is unfortunate that he has not found т іо 
mention the multivariate normal distribution, multiple regression, or to gtve the Helmert айй aun 
spherical polar transformations explieitly. Nevertheless, this is an excellent book whieh deserves to 
become standard. в. T. BEAEEOWS 


Introduction to Functional Analysis. By А. E. TAYLOR. New York: John Wiley and 
Sons Inc.; London: Chapman and Hall Ltd. 1958. Pp. 423. 100s. 


Professor Taylor avowedly intends that this book shall help 
starts with a modest background consisting mainly is а е 
Analysis’, the additional necessary preparation regard mE is dm 
provided within the book. He has succeeded in attaming Ы в оа 
to use as a text accompanying early postgraduate training. he su | ice extend н. DE 
well illust rated by examples, the choice of which is not limited to the о! ^ ) 


1; won PH :lor's 5 i res 

P «Ge es and Summability Theory. I rofessor Taylor в personal interest 
dated) applications to sed for example, has provided a refreshing source of illu- 
in applications to 


known illustrations are provided by discussions of the 

Кы examples. f the Dirichlet Problem and Weyl's* Projection Method ‘in Potential Theory. 

functional analytic aspects о lance and cover à wide range of difficulty. Тһе more difficult questions, 

Problems appear ыссы pes estions, must frequently succeed in stimulating the reader to continue 

са accompanied MER the text. Most chapters begin with а ку shieli ila ап 

> tre: i ht intro я major theorems to follow. The 
осна сети and an indication of the re 1 


the beginner їп Е unetional Analysis who 
d Honours Degree course in *elassical 
ar Spaces and General Topology being 
t, the book being admirably suited 


nee spac 


complex variable problems, 
Other comparatively little 


lations between. the т 
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b i гі һе 
in may nt wi У wor Ww specialis: it is in accord with t 

t always be that which would be av rarded by thes pecialist, but it is in 

i ota у: 

allocatio Y 


iti results resulting 
k. This same remark applies to the occasional repetition of fundamental res 
i is book. This в: | 
aim = emphasis оп normed linear Spaces. 
from 


i f the book. 
ing the specific contents o 1 ook, 
ыы = major portion of the basic equipme 
m dage by dealing with Linear Spaces and 
5 


Р ch 

it is fair to say that, apart from the theory af | 2 

nt of a functional analyst is provided, ( ае (Topo- 

General Topology respectively, whilst io further 

Some of the topics discussed in chapter 3 —— ial 

ion in chapter 4; in particular, Duality Theory appears in both chapters. € He a of linear 

E cce en Theorems for linear functionals and operators, the deeper proper Me emi 

Jh Bep "n momorphism, Inversion and Closed Graph T hi : jm aro 
erators. Applications of the Riesz Convexity Theoret 


Вы rem for 

Speetral Decomposition 'I nar Тойу 
354% кз y brielly 

ounded self-adjoint operators is only 


n. ion of 
4 eer ET eee А гїп 
Chapter 7, entitled "Integration and Linear Funetionals ‘is perhaps the least satisfac tory po 
the book. It provides a guide to existing accounts, w 
This reviewe 


of the topics under discussion, 
wise full reward will be reserved (perhaps not unjustly) for 
the reading Suggested. Abstra 


idon measures are all «c 
with, albeit rather summarily, as also are bounded, finitely-additive set functions and their integrals 
The typography is excellent, and no misprints were found. 
ne may confidently expect + i ill fi vell deserved place 
of many students and teachers of F 
Possible t 


Analysis, particularly so in th 


» knowledge 
the reader who already has some O 
ould be largely superfluous anyway. 


4 Follow up 
the keen reader who is prepared to foll 
measures апа В: 


¿shelves 
on the personal bagra 11% 
е former case if it shoulc 


s 
R. E, EDWARD 


‘opra, С.Е: 

(Royal Society Mathematical Tables, Vol. 4.) By H. GUPTA, p : 

WYTHER and J, С. р. Miter, Cambridge University Press for the Royal Society 
1958. Pp, Хххіх +132, 635 

Tn spite of their 4; 


rime 
er) in terms of both circulators "ae o 
no i йөр? i scrip 

computation of the tables and MPtotis formulae ig given followed by а deseri[ | 
ІП, 

lues of p.n, m) [Table 11 ; p(n, m) [Table 

н ‚рр. 90-121]; p(n, d 
to be expressed, [rii a ар. 132], tabulated Overa complicated range of es not briefly 

virtue of such relations ag tons may be used Successively t, 


" in 
9 check and extend the earlier tables 
pn,m) = Dts 1 


r=0 


Poln, т) = У Пер 


a — rm — 
r=0 


"рум тт ir(r4 12), 


(+ 1),r), 
Partitions of n, bei Spondingly lar 


© 
© 
У рап, т) "s 1902, t)}s—1 py P(n, m) t», 
n- 
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4 © со m 
where Фи = П 01-07). Урп,т)й = П a-r. 
r=1 n=0 r=1 


The tables are not likely to be of great use to statisticians, who will largely be well satisfied with much 
shorter earlier tables, but it is very heartening for probabilists to have such extensive tables for refer- 
ence and use in the rare combinatorial problems which demand them. 

D. E. BARTON 


Table for the Solution of Cubic Equations. By Н. E. SALZER et al. U.S.A. and 
London: MeGraw-Hill Book Co. Inc. 1958. Pp. xv +160. 58s. 


Тһе table gives the roots of the general cubic equation 


ax? + bar? + ex +d = 0 


by means of tabulating the roots of Ofs+f = 1 


as functions fi, fe Ja of 0. It is left to the reader to compute the argument 


0 = M(3a*d — abe + 908) (Зас — b°)? 


and convert the f-root back to the z-root by means of 
w = – ра —f(8ad — be + $09 /a) (Зас — 0%). 


t is Ü and the interval is 0-001. The whole range of 0 іѕ thus 
igures, and first and second differences are also given. When 
they are tabulated as 2 + if (being con- 
slightly 


For |0| > 1 the argument is 1/0; otherwise i 
covered. The entry is to 7 and 8 significant f 
two of the roots are complex, as they are outside ( — 4/27, 0), 
jugato). This is in spite of the fact that — 22 equals the real root and the imaginary part is only 
less easily derived. 

The introductory matter te 
everywhere’. It is suggested that the tables w 
third-order interpolation, but the reviewer can conce 
paying the prohibitive price. р/н. BARTON 


IIs us that ‘linear or quadratic interpolation will yield full aceuracy nearly 
ill be commonly useful in what amounts to inverse 
ive of no use for them at all which would justify 


Surveys in Applied Mathematics. New York: John Wiley and Sons Inc.; London: 


Chapman and Hall Ltd. 1958. 

Vol. 1. Elasticity and Plasticity. By 
50s. 

Vol. 2. Dynamics and Nonlinear Mechanics. 
Рр. xiit+ 206. 62s. 

Vol. 3. Mathematical Aspects of Subsonic and Tr 


Lipman Bers. Рр. xv+164. 508. - 
Vol. 4. Some Aspects of Analysis and Probability. By IRVING КАРГАМЗКУ, MARSHALL 


HALL JR., EDWIN Hewitt and ROBERT FonrET. Pp. хї+ 243. 798. 
Vol. 5. Numerical Analysis and Partial Differential Equations. 


Жонвутнв and Paur С. Возкмвьоом. Pp. х--204. 60s. 
i joi oject of the Office of Naval Research and Applied 
y геу articles ге been written as a joint project of ice а 1 and 
foots нес: = ayia are aimed *...not so much at research specialists, actively contributing to the 
Б ve anios Rev dna ad at а broader, mathematically literate audience, looking for contemporary in- 
fer Jer өрт ее important problems and results in these disciplines’. A major objective is to present 
mation on the 1 pant 


T ian journals. 
те ts appearing in Russian Jou а ы " 4 А KERN 
и к "cer be warned that the degree of mathematical literacy assumed cargos 
abl pue aa a ;hether any one person will have the energy, ability or inclination to а p ias 
= = 18 аа Ч ағ 1тен Certainly your reviewer is not competent to comment on the first three. 
ents of a ese v d 


J. М. GooprzR and P. G. Норак, Jr. Pp. ix + 152. 
By E. LzrwANIS апа N. MINORSKY. 


ansonic Gas Dynamics. By 


By GEORGE E. 
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bvious temptation confronting the author of such a surve 
b uem undue emphasis on b ow 
23 е will necessarily be out ofc ate 
Tin references later than 1956. A 
ticles most interesting to Statisticians per se i ES ce pma cr 
аг Advances in Probability Theory’ (Fortet) in vol. 4, and * Contemporary \ 
pe x NA in vol. 5. The other articles in vol. 4, namely 
A ever ar Abstraet Harmonic Analysis’ (Hewitt) may 
a E 
theoretic. | 
Hall’s article on combinatorial 
The emphasis throughout is enti 
issue of Biometrika); neither 
little sympathy with method 
suffers greatly from compari 
distinct representatives, w 
(Existence and Constructi 
of the theory, 


ed picture 
y is to fail to give a impero e 
ee p ^c roided. Also 
n work; this temptation has not always been avoid 
3 this 


` very 
è * iis S there are ver) 
almost before they are published; in these volumes t 


5 ; (Hall) 
у геу of C i ial Anal ( 
are ʻA $ urvey of Combinatori disini 
“ХЕ insky) 
"Functional Analysis? (Кар 


es B TS measure- 
interest. specialists in matters n 


T work. 
analysis (68 PP.) gives only three glancing references iv Mey last 
rely different from that of Riordan's recent book (гехі ш Тев уній 
Hall nor Riordan give üny references to the other's рарогя, B нала деді 
s using generating functions; his first section (Methods of pei on 
son with Riordan's book, His second part is devoted to е ‚ final part 
ith its generalizations and some applications: the discussion HE , er ?) parts 
on of Designs) concentrates on the more ole zant (and less practi 


ыа 
f согу of stochastic eme 
K on axiomatic ions, central limit theorems ұс м, 
ms for densities), “general random elements’, func Mihi 
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